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METHODS AND COMPOSITIONS 
FOR CELL ULAR AND METABOLIC ENftTNKTrPTMit: 

This application is a continuation-in-part of U.S. 
Patent Application Serial No . 03/621 , 859 , filed March 25, 
1996, Serial No. 08/621,430, filed March 25, 1996, Serial No. 
08/425,684 filed, April 18, 1995 and Serial No. 08/650,400, 
filed May 20, 1996, che specifications of which are herein' 
incorporated by reference in their entirety for all purposes. 

Backgro und of the Invention 
Metabolic engineering is the manipulation of 
intermediary metabolism through the use of both classical 
genetics and genetic engineering techniques. Cellular 
engineering is generally a -ore ' inclusive Cera' referring to 
the modification of cellular properties. Cameron et 'al . 
(Appli ed 3iochem. Biotech. 3H:-Q=;-un (1993)) provide a 
summary of equivalent terms to describe this type of 
engineering, including "metabolic engineering", which is most 
titer, used in the context cf industrial microbiology and 
bicprocess engineering. - ir. vi zrc evolution" or "directed 
25 evolution", most often used m the context of environmental' 

--crobioiogy, "molecular breeding", most often used by... 

Japanese researchers, "cellular engineering", which is used to 
describe modifications of bacteria, animal, and plant cells, 
"rational strain development", and "metabolic pathway 
30 evolution". In. this application, the terms "metabolic 
engineering" and "cellular engineering" are used 
preferentially for clarity,- che term "evolved" genes is used " 
as discussed below. 

Metabolic engineering can be divided into two basic 
35 categories.- modification of genes endogenous to the host 

organism to alter metaooiite flux and introduction of foreign 
genes into an organism Such introduction can create new 



3.-.SCOCI0- - 'WO 



WO 97/35966 



PCTAJS97/04715 



2' 



t he 



metabolic pathways leading to modified cell properties 
including' but, not limited to synthesis of known compounds not 
normally made by the host cell, production of novel compounds 
(e.g. polymers, ■ antibiotics, etc.) and the ability to utilize 
'new nutrient sources. Specific applications of metabolic 
engineering can include the production of specialty and novel 
chemicals, including antibiotics, extension of the. range of 
substrates used for growth and. product formation, the 
production of new catabolic activities in an organism for 
toxic chemical degradation, "and modification of cell 
properties such as resistance to 'salt and other environmental 
factors . 

Bailey ( Science 252:1668-1674 (1991)) describes' the 

application of metabolic engineering to the recruitment of 

'heterologous genes for the improvement;' of .a strain, wit 

caveat that such introduction can result in new compounds that. 

nav subsequently undergo further reactions . ; or that expression 

of a heterologous protein can result in proteolysis, improper 

folding, improper modification, or unsuitable intracellular 

location' of the protein, or lack of access to required 

substrates. " Bailey recommends careful configuration of a ' 

desired: genetic 'change with minimal perturbation of the host. 

Liao t rurr. Coir 3iotech. 4:211-216 (1993)) 

' reviews mathematical modelling and analysis of metabolic • ; 

pathways, pointing out that in many cases the kinetic 

parameters of enzymes are unavailable or inaccurate. 

Stephanopoulos et ai. (Trends, oiotechnol. 11:392- 

396 (1993)) describe attempts to improve productivity of 

cellular systems or effect radical alteration of the flux . 

through orima-ry metabolic pathways as having difficulty in . ' 

that control architectures a-' key branch points have evolved 

■to resist flux changes. They conclude that identification and 

.. ■ • .p 0 , s -o-a'-rV ; c nodes is a prerequisite cc 

cnaractenzat ion Oi. i_r.es = ...^_c~.j- 

rational metabolic engineering. Similarly, Stephanopoulos 
( Curr. Ooin. Biotech. 5:196-200 (1394)) concludes that rather 
than modifying the "rate limiting step" in metabolic 
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engineering, it is necessary to systematically elucidate the 
control architecture of bioreaction networks. 

The present invention is generally directed to the 
evolution or new metabolic pathways and the enhancement of 
bioprocessing through a process herein termed recursive 
sequence recombination. Recursive sequence recombination 
entails performing iterative cycles of recombination and 
screening or selection to "evolve" individual genes, whole 
plasmids or viruses, multigene clusters, or even whole genomes 
fStemmer < 3io/Technolocy 13:549-553 (1995)). Such techniques 
do not require the extensive analysis and computation required 
by conventional methods for metabolic engineering. Recursive 
sequence recombination allows the recombination of large 
numbers of mutations in a minimum number of selection cycles, 
in contrast to traditional, pairwise. recombination events. 

Thus, because metabolic and cellular engineering 
can pose the particular problem of the interaction of many * 
gene products and regulatory mechanisms, recursive sequence ' 
recombination (RSR) techniques provide particular advantages"' 
in that they provide recombination between mutations in any or 
ail of these, thereby providing a very fast way of exploring/ 
the manner ir. which different combinations of mutations car. 
affect a desired result, whether that result is increased 

d_ — - - — ^: al V^" activity or suostrate 

s?eci::c;:y of, ar.- en:r:o or an en:ire ~etabolic pathway, or 
^ soon s e ct a ce_ to its environment 



* o v o v- , 



Summar y of the Invention 
One aspect of the invention is a method of evolving 
a biocataiytic activity of a cell,, comprising: 

(a) -recombining at least a first and second DNA 
segment from at least one gene conferring ability to catalyze 
.a reaction of interest, the segments differing from each other 
in at least two nucleotide.-. :: produce a library of 
recombinant genes; 

(b) screening at least one recombinant gene from 
the library that confers enhanced ability to catalyze the . 
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.reaction of interest by the cell relative to a wildtype form 
of the gene / 

. .(c) recombining at least a segment 'from at least 
one recombinant gene with-a further DNA segment from at least, 
one .gene, the same or different from. the first and second 
segments, to produce a further '1 ibrary-of . recombinant genes,* 

(d) .screening at least .one further recombinant 
gene from, the further library of recombinant genes that . 
confers, enhanced ability to. catalyze the reaction ■ of interest 
in the cell relative, to a previous recombinant gene;. 

(e) repeating (c) and (d) , as. necessary, until- the 
'further recombinant gene confers a desired level of enhanced, 

ability ' to' catalyze ■ the. -reaction of interest by the 'cell: 
Another aspect of the invention is a method' of 
15'; 1 evolving a ' gene to confer abil i cv tp ; cata iyze a reaction -of 
7 interest., the method comprising: ■ / ' ^ , 

' " . (U recombining at least first' and second DN ? A . 

": ' • segments .from at' least .one gene , conferring Ability to catalyze 
a reaction of interest, the segments differing- from each other 
2 0/ in at least two nucleotides, to produce a library of 
'recombinant genes; ; • ' 

(-2) screening- at. -least _one recombinant ■ gene/' f rom 
the library that" centers enhanced ability to .catalyze . a 
• l ■ , reaction, of interest. relative' to a wildtype form of the gene; 
?z ■ ( 2| • recorbinirvg- at least a segment' from "the at. 

7 ' least one recombinant gene with a further -DNA • segment. . from the 
. at least one gene/ the same, or -different from the firsthand 
7 ' second segments /to produce a -further, library, of. recombinant 

' -genes ; , _ . ■■ 7 ■ • 

■ 3 o •, .7 • (4) screening- at least one further, .recombinant . 

gene from' the 'further, library of 'recombinant genes that ^ '- 
confers enhanced ability to catalyze* a reaction of interest 
relative to^a previous recombinant gene; 

(5) repeating (3) and {4}, as necessary, until the 
35 further recombinant gene confers a desired level of enhanced 
ability to catalyze a reaction of interest. 
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A further aipect of the invention is a method of 
generating a new biocatalytic activity in a cell , ' comprising : 

(1) recombining at least first, and second DNA 
segments from at least one gene conferring ability to catalyz 
a first reaction related to a second reaction of interest, th 
segments differing from each other in at least two 
nucleotides, to produce a library of recombinant genes; 

(2) screening at least one recombinant gene from 
the library that confers a new ability to catalyze the second 
reaction of interest; 

(3) recombining at least a segment from at least 
one recombinant gene with a further DNA segment from the at 
least one gene, the same or different from the first and 
second segments, to produce a further library of recombinant 
genes ; 

(4) screening at least one further recombinant * 
gene from the further library of recombinant . genes that 
confers enhanced ability to catalyze the second reaction of K 
interest in the ceil relative to a previous recombinant gene'v 

(.5) repeating (3! and (4), as necessary, until the 
. further recombinant gene confers a desired level of enhanced 
ability to catalyze the second reaction of interest in tho 
coll. 

Another aspect of the invention is a modified form, 
of a ceil, wherein the modification comprises a metabolic 
pathway evolved by; recursive sequence recombination. 

. - ■ •• -A rur-tner -aspect of "the i'nventioh"is a' method of 
optimizing expression of a gene product, the method 
comprising : 

(1) recombining .at least first and second DNA 
segments from at least one gene conferring ability to produce 
the gene product, the segments differing from each other in at 
least two nucleotides, to produce a library of recombinant 
genes; 

(2) screening at least one recombinant gene from 
the library that confers optimized expression of the gene 
product relative to a wiidtype form of . the gene; 
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(3) recomblning at lease a segment from the at 
least one recombinant gene with a further DNA segment from the 

. . at least one gene, the same or different from the first, and 
= second segments, to produce a further library of recombinant 
5 genes ; 

(4) screening at' least -one further recombinant ' 
gene from the further library of recombinant genes that 
confers optimized ability to produce the gene product relative 
to a previous recombinant gene; 

10 (5) repeating (3) and (4), as necessary, until the 

.. further recombinant gene confers a desired levex of optimized 
ability to express the gene ' product . •. 

A further aspect of the invention is a method of 
evolving a- biosensor for a' compound A • of interest, the method , 

15' comprising: • ' . 

(I! recombining at least f irst and second ' DNA. 
'segments from at lease one.' gene conferring ability ,to detect a 
related compound 3, the segments differing from each other in' 
at lease 'two nucleotides, to produce a library of recombinant 

20 genes ; 

(2) screening at least one recombinant gene from 
the 1'ibrary that confers optimized ' ability to detect compound 
A relative to a wiititype form of the gene; 

(3) recombinir.g at least a segment from the at 
least one recombinant gene with a further DNA segment 'from the 
at least one gene, the same or different from the. first and 
second segments', to produce a further' library of recombinant 



cenes ; , ( 1 

(4) screening at kas: one further re'ecrnbinant 

'30 gene from 'the further library of recombinant, genes that 

confers. optimized ability to detect compound A relative to a 
previous recombinant gene; 

(5) . repeating (2) ar.d (4) . as necessary, until the 
further recombinant gene confers a desired level of optimized 

35 ability to detect ccmpound A. . 
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Brief D escription of the Drawings 
Fig. 1 is a drawing depicting a scheme for in vitro 
recursive sequence recombination. 

Description of th e Specific Embodiments 
The invention provides a number of strategies for 
evolving metabolic and bioprocessing pathways through the 
technique of recursive sequence recombination; One strategy 
entails evolving genes chat confer the ability to use a 
particular substrate of interest as a nutrient source in one 
species to confer either more efficient use of that substrate 
in that species, or comparable or more efficient use of that 
substrate in a second species. Another strategy entails 
evolving genes that confer the ability to detoxify a compound 

' interest in one or r.ore species of organisms. Another 
strategy entails evolving r.ew metabolic pathways by evolving"' 
an enzyme or metabolic pathway for biosynthesis or degradation 
of a compound A related to a compound B for the abilitv to ' : 
biosynthesize or degrade ccmcounc B , either in the host of > 
origin or a new host. A further strategy entails evoivina a 
gene or metabolic pathway for more efficient or optimized' 
expression of a particular metabolite or gene product-. A " 
rurther strategy entails evolving a host/vector system for ' f 
expression of a desired heterologous product. These 
strategies may mvcive using all the genes in a multi-step 
pathway, one or several genes, genes from different organisms, 
.or one. or. .more -fragments' of "a Gene." " " 

The strategies generally entail evolution of 
gene(s) or segment (s) thereof to allow retention of function 
m a heterologous cell or improvement of function 'in a 
homologous or heterologous cell. Evolution is effected 
generally by a process termed recursive seouence 
recombination. Recursive sequence recombination can be 
achieved in. many different formats and permutations of 
formats, as described in further detail below. These formats 
share some common principles. Recursive sequence ' 
recombination entails successive cycles of recombination to 
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•generate molecular diversity, i.e., the creation of a family 
of nucleic acid molecules showing substantial sequence . ■ 
identity to each other but differing in the presence of 
mutations. Each recombination cycle is followed by at least ■ 
one cycle of screening or selection for molecules having a 
desired characteristic. The molecule (s) selected in one round 
form the starting materials' for generating diversity in the 
next round. In any given cycle, recombination can occur in . 
vivo or in vitro. Furthermore, diversity resulting from 
recombination can be augmented in any cycle by applying prior 
methods of mutagenesis (e ,g ./ error-prone. PCR or cassette 
mutagenesis, oassage through bacterial mutator strains, ' 
treatment' with chemical mutagens) to either the substrates tor 
or products of recombination. 

t Formats for P»rnrsive Seq uen t Recombination 

Some formats and ex^pies for recursive sequence . 



ed co as DNA shuffling r or 
cer;dir.c appl icat ions ,. U.S. 



recornbination, sometimes 

v, ^, , ^ u 00 - -:-> iTc'^i b<=»d by the -present: 
molecular breeding , nave ^ee.. ~>y y 

inventors and co-workers, :r. c 

~^;^ ( --/n^" 1 4 30, filed March 25, 

Patent Aoolication ier..: - - / o~ - , i j u , 

„ , v . t>—/:;q-t /r f ?i led February ,17, -1995; ■ 
199 6; Serial No. P^i/u^-/^ 

, vi ^5 1996; Serial No. 

Serial No. 03/62;., S — , .-ec ^, 

, - -> ; o; c o — T o >- Science 
03/193,4 3 1, filed .-eoruary ±/, — ' 

n n c-*— ^ a ^ Gene 164:49-53 (1995) ; 

270:1510 (i9?o) ; S-6 - d --< ^ 

■ . -1-S4 9 -553 ( 1995); Stemmer, Pro: ■ 

v*^ A-ad Sci. .U.S.A.- 91:10747 -10751 (1994); Sterner, Nature 
370-389-3*T^^ et -al-: NaXur^Me^ir^ 2(1):1,3 y 

(isie); Crameri et'al. Nature ^orechnolccy 14:315-319 (1996) 

■ ' ~ *. • ^Vn-^or bv reference -in its entirety fo 

each or wmcn ~s L.vO.p^— - 

all purposes. 

(I) In Vitro 



One format for roc — sive sequence recombination in 

, _ ^ r • ~ - Tpo initial substrates for 

vitro is illustrated m r_-. 

recombination are a pool of rented sequences. The X's in 
Fig. 1, panel A, show where the sequences' diverge . The 
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sequences can be DNA or RNA and can be of various lengths 
depending on the size of the gene or DNA fragment to be 
recombined or reassembled. Preferably the sequences are from 
50 bp to 100 kb. 

The pool of related substrates can be fragmented, 
usually at random, into fragments of from about 5 bp 'to 5 kb 
or more, as shown in Fig. 1, panel B . Preferably the size of 
the random fragments is from about 10 bp to 1000 bp, more 
preferably the size of the DNA fragments is from about 20 bp 
to 500 bp. The substrates can be digested by a number of 
different methods, such as DNAsel or RNAse digestion, random 
shearing or restriction enzyme digestion. The concentration 
of nucleic acid fragments cf a particular length or sequence 
is often less than 0.1 V or 1% by weight of the total nucleic 
acid. The number of different specific nucleic acid fragments 
in the mixture is usually a: least about 100, 500 or 1000. l i 
The mixed population of nucleic acid fragments are '* 
denatured by heating to about B0 C C to 100° C, more preferably; 
from 90° C to 96° C, form s ir.gle.- s tranded nucleic acid ?, 

fragments and then reanneaiec. S ir.gle - stranded nucleic acid 
fragments having regions of sequence identity with other j 
single -stranded nucleic acid fragments can then be reanneaiec - 
oy cooling to -20 3 C to 7S C C, and preferably from 40°' C to 6 5°-{ 
C. Renacurat ion can be accelerated by the addition of ■> 
polyethylene glycol \^ZZ^) cr salt. The salt concentration 
is preferably iron C rr:-: to 500 -y. , more preferably the salt 
c one e n t r a t i o n~ " is' f re m "l 0" t'o '1 C "0" r?y , " TheVsalt may be such 
salts as (NK 4 ) : S0 4 , KCl, or NaCi . The concentration of PEG is 
preferably from 0k to 2 0k , more preferably from 5V to 10%. 
The fragments that reanr.eai can be from different substrates 
as shown in Fig . 1 , pane 1 C . 

. . The annealed nucleic acid fragments are incubated 
in the presence of a nucleic acid polymerase, such as Taq or 
Klenow, and dNTP's (i.e. cL-.T? , cCT? , dGT? and dTTP). If 
regions of sequence identity are large, Taq or other high- 
temperature polymerase can be used with an annealing 
temperature of between 45-S5°C. If the areas of identity are 
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small, Klenow or other low- temperature polymerases can be used 
with an annealing temperature of between 20-30°C. The 
polymerase can be added to the random nucleic acid fragments 
prior to annealing, simultaneously with annealing or after 
annealing. 

The' cycle of denaturat ion , renaturation and 
incubation of random nucleic acid 'fragments in the presence of 
polymerase is sometimes referred to as "shuffling" of the 
.nucleic acid in vitro. This cycle is repeated for a desired 
number of times. Preferably the cycle is repeated from 2 to 
100 times, more preferably the sequence is repeated from 10 to 
40 times. The resulting nucleic acids are a family of double- 
stranded . polynucleotides of from about 50 bp to about 100 kb, 
preferably from 500 bp " to 50 kb, as shown in Fig. 1, panel D. 
The population represents variants of the starting substrates 
showing substantial sequence identity thereto but also^ 
diverging at several positions. The population has many more 
members than the starting substrates. The population cf 
fragments resulting from, recombination is preferably ."first " 
amplified by ?C?*, then cloned into an appropriate .vector and 
the 'ligation mixture used to transform host, cells'. 

In a variation cf in vitro shuffling, subsequences 
of recombination substrates can be generated by amplifying the 
.full-length sequences under conditions which produce a 
substantial fraction, typically at least 20 percent or' more, 
of ' incompletely extended amplification' products. . The 
amplification products, including the incompletely extended 
a~.pl i f icat ion, products are denatured and subjected to, at least 
one additional cycle' of reannealing and amplification. This 
variation, ' wherein at least one. cycle 1 of reannealing and ■ 
amplification provides a substantial fraction of incompletely 
extended products, is termed " stuttering . " In. the subsequent 
amplification round, the incompletely extended products anneal 
to and prime extension on different sequence - re la ted template 
species. 

In a further variation, at least one cycle of 
amplification can be conducted using a- collection of 
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overlapping single-stranded DNA fragments of related sequence, 
and different lengths. Each fragment can hybridize to and 
prime polynucleotide chain extension of a second fragment from 
the collection, thus forming sequence-recombined • 
polynucleotides. In a further variation, single -stranded. DNA 
fragments of variable length can be generated from a single 
primer by Vent DNA polymerase on a first DNA template. The" 
single stranded DNA fragments are used as primers for a 
second, Kunkel-type template, consisting of a 

uracil-containing circular single-stranded DNA. This results 
in multiple substitutions of. the first template into the 
second (see Levichkin et al . Mol . Biology 29:572-577 (1995)). 

Gene' clusters such as those involved in polyketide 
synthesis (or indeed any multi-enzyme pathways cacalyzinc 
15 analogous metabolic reactions) can be recombir.ed by recursive 
sequence recombination even if they lack DNA sequence a. 
homology. Homology car. be introduced using synthetic • 
oligonucleotides as PCS primers. In addition' to the soecific 
sequences for the gene being amplified, all of the primers 
used to amplify one type of enzyme (for example the acyl 
carrier protein in polyketide synthesis) are synthesized to •« 
contain an additional sequence of 20-40 bases 5' to the gene' 
(sequence A) and a" different 20-40 base sequence 3' to the ;> 
gene (sequence 3). The adjacent gene- (in this case the ketb-" 
synthase) is amplified using a 5' primer which contains the 
complementary strand of sequence 5 (sequence 3'), and a 3' 
. P-i^r 'containing „a different .2-3- -to- -base sequence (C) . 
Similarly, primers for the next adjacent gene (keto-' 
reductases) contain sequences C (complementary to C) and D. 
If 5 different polyketide gene clusters are being shuffled, 
all five acyl carrier proteins are flanked by sequences A and 
5 foliowin 9 their ?CR amplification. In this way, small 
regions of homology are introduced, making the gene clusters 
into site-specific recombination' cassettes . Subsequent to the 
initial amplification of individual genes, the amplified genes 
can then be mixed and subjected to primer less PC?.. Seauence B 
at the 3' end of all of the five acyl carrier protein genes . 
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can anneal with and prime DNA synthesis from sequence B' at ■ 
the 5' end of all five keto reductase genes. In this way all 
possible combinations of- genes within the cluster can be 
obtained. Oligonucleotides allow such recombinants ■ to be 
obtained in the absence of sufficient sequence homology for 
recursive sequence recombination described above , Only 
homology of function is required to produce functional gene 
clusters . 

This method is also useful for exploring 
permutations of any other multi -subunit enzymes. An example 
of .such enzymes composed of multiple polypeptides that have . 
'shown novel functions when the subunits are combined in novel 
ways are dioxygenases . Directed recombination between the 
four protein subunits of bipher.yl and toluene dioxygenases 
produced 'functional dioxygenases ' with increased activity 
against trichloroethyler.e (Furukawa et. al. J. Bacterid . 'I7S: 
2121-2123 (19S4)). This" combination of subunits from the two 
dioxygenases could also have been produced by cassette- 
shuffling of- the dioxygenases , as described above, followed by 
selection for degradation of ^trichloroethylene . 

■In some poiyket'ice synthases , ..the • separate 
functions of the' acyi carrier protein, keto - synthase , keto-. 
reductase, etc. reside in a single polypeptide. In these 
cases domains within the single polypeptide- may be shuffled, 
even if sufficient homology does not exist naturally, by 
introducing regions of homology as described above for entire- 
genes', in this case, it may not be possible to' introduce 
additional flanking sequences to the domains, due to the 
• constraint of maintaining ' a continuous open reading f rattle .■ 
Instead, grouos of oligonucleotides, are' synthesized that are- 
homologous to the' 3' end of the first domain encoded by 'one o 
the genes to be shuffled, and the 5' ends of the second ' 
domains encoded by ail of the ether genes to be shuffled 
together. This is repeated with all domains,, thus proving 

- ~ rp^p^h-'-acion between protein domains 

sequences t nat allow re^cj.^.^.i ^- r 

while maintaining their order. 
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The cassette-based recombination method can be ' 
combined with' recursive sequence recombination by including 
gene fragments (generated by DNase, physical shearing, DNA 
stuttering, etc.) for one or. more of the genes. Thus, in 
addition to different combinations of entire genes within a 
cluster (e.g., for polyketide synthesis) , individual, genes can 
be shuffled at. the same time, (e.g., all acyl carrier protein 
genes can also be provided as. fragmented DNA) , allowing a more 
thorough search of sequence space. 



( 2 ) In Vivo Formats 

(a) Plasmid- Plasmid Recombinarinn 
The initial substrates, for recombination are a 
collection of' •poly.nucleot ides comprising variant forms of a 
15 gene. The variant forms usually show substantial sequence 
• identity to each other sufficient - to allow homologous 
recombination between substrates. The diversity between th- T 
polynucleotides can be natural' (e.g.., allelic, or species A 
variants), induced (e.g., •:• .•:.-cr- prone ?CR or error -prone ' 
recursive sequence recombination}, or the result of* in vitro 
recombination. Diversity tan also result from resynthesizinc , 
genes encoding natural proteins with alternative codon usage. .'. 
There- should be at least sufficient diversity between " ^ 

substrates that recombination can generate more diverse '* 
products than there are starting mat-erials. There must be . at 
y aSZ £wo substrates differing in at least two- positions.. 
However , commo n-I-y- -a- ko^ry^of - s u bs rr^tre s" " of 7 TO* - 1 o*' "ie~ber s " i s 
employed. • The. degree of diversity depends on the length of ' 
the substrate being reccmbmed and the extent of the - 
functional change -.to- be evolved. Diversity at between 0.1-2^, 
o£ positions .is typical. The diverse substrates are 
incorporated into plasmids. Tine plasmids are often standard, 
cloning, vectors, e..g.,. bacterial mult icbpy plasmids .' However,, 
in some methods to b.e. d-escr: below, the plasmids include 
mobilization (MOB) functions.. The substrates can be 
incorporated into the same. or different plasmids,/ Often at 
least two different typ.es of piasmid having different types of 
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selectable markers are" used to allow selection for cells 
containing at <. least two types of vector. Also, where 
different types of plasmid are employed, the .different 
plasmids can come from two distinct incompatibility groups to 
allow stable co-existence of two different plasmids within the 
cell. Nevertheless; plasmids from the same incompatibility 
group, can still co-exist within the same cell for sufficient 
time ,to allow homologous recombination to occur. 

Plasmids containing diverse substrates are 
initially ' introduced into cells by any method (e.g., chemical 
transformation, natural competence, elect ropora t ion , 
biolistics, packaging into phage or viral systems) . Often, 
the plasmids are present at or near saturating concentration 
(with respect to maximum transfection capacity) to increase 
the probability of 'more than one. plasmid entering the same .. 
ceil.' The plasmids containing the various substrates can be 
transfected Simultaneously or in multiple rounds/ For' 
example^ in the latter approach' ceils can be transfected with. * 
a first aliquot of plasmid ,. trar.sf ectants selected and 
propagated,, and then, infected with a second aliquot of 
plasmid.. 

' Havinc ■ introduced - ' the plasmids into' ceils, 
recombination between substrates to generate recombinant genes 
occurs within cells containing multiple different plasmids 
merely bv prorogating the. ceils. However, cells that receive 
only one' plasmid are unable to participate in recombination 
and the potential contribution of substrates on such plasmids 
to evolution is not fully exploited. (although these plasmids 
may contribute to some extent if they, are progagated in 
mutator .cells) The rate ..of evolution can be increased by" t 
allowing all substrates to participate in recombination. Such 
can be achieved by subjecting transfected ceils to 
electroporation. The conditions for elect roporation are the 
same as those conventionally used for introducing exogenous 
DMA into cells (e.g., 1,000-2,500 volts, 4 00 piF and a 1-2 mM 
cap) . Under these conditions, plasmids are exchanged between 
cells allowing all substrates to participate in recombination. 
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In addition the products of recombination can undergo further 
rounds of recombination with each' other or with the original 
substrate. The rate of evolution can also be increased by use 
of conjugative transfer. To -exploit, conjugative transfer, 
substrates can be cloned into plasmids having MOB genes, and 
era genes are also provided in cis or in trans to the MOB* 
genes. The effect of conjugative transfer is very similar to 
electroporation in that it allows plasmids to move between 
cells and allows recombination between any substrate and the 
products of previous recombination to occur, merely by 
propagating the culture. The rate of evolution can also be 
increased by fusing cells to induce exchange of plasmids or 
chromosomes. Fusion can be induced by chemical agents, such 
as PEG, or viral proteins, such as influenza virus 
hemagglutinin, HSV-1 g3 and g D . The rate of evolution can 
also be increased by use of mutator host cells (e.g., Mut L, V 
S, D, T, H in bacteria and Ataxia telangiectasia human cell 
lines) . 

The time for which .cells are propagated and 
recombination is allowed to occur, of course, varies with the 
ceil type but is generally not critical, because -even a small, 
decree of recombination can substantially increase diversity 
relative to the starting materials. Cells bearing plasmids " 
containing reccmbir.ee genes are subject to screening or ^ 
~ — -£"-n .o. n u^.rt: .unction. rcr example, if the 
suostrate being evolved contains a drug resistance gene, one 
would -select -for drug!' resistance . Cells surviving "screening" 
or selection can be subjected to one or more rounds of ■ 
screening/selection followed by recombination or can be 
subjected directly to an additional round of recombination. 

The next round of recombination can be achieved by 
several different formats independently of the previous round. 
For example, a further round of recombination can be effected 
simply by resuming the electroporation or con j ugat ion -media ted 
intercellular transfer of plasmids described above, 
Alternatively, a fresh substrate or substrates, the same or 
different from previous substrates, can be transfected into 
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- cells surviving selection/screening. Optionally, the new 
substrates are included- in plasmid vectors bearing a different 
selective marker and/or from a different incompatibility group 
than the original plasmids... As a further alternative, cells 
surviving selection/screening can be subdivided into two 
subpopulations, and- plasmid DNA from one subpopulation 
transacted into the other, where the substrates from the 
'plasmids from the two subpopulations undergo a further round, 
of recombination. In' either of the latter two options, the 
rate' of. evolution' can be increased 'by employing DNA 
extraction, electroporation, .conjugation or mutator cells , as 
described above;. In 1 a ;still further variation, 'DNA from cells 
surviving screening'/select ion can be extracted and subjected 
to in vicro-, recursive sequence recombination. 

After the second round of recombination, a second 
round' of screening'/ select ion is performed, - preferably under . * 
conditions of * increased stringency. If desired-, further 
rounds ' of recombination- and se 1 ec: ion/screening - can be 
performed using the' same strategy as . for the second round: 
With/successive round S' of recombination and' ■ 
se lect ion/screeriinc, 1 the surviving recomoined .subs crates - 
evolve toward . accuis.it ion of 'a desired p he no type*. typically.,;' 
in this and other methods of recursive recombination, the 
f in'al oroduct of recorrJc i n = t i on ■ t ha t has ■acquired the desired 
'ohenotvno'e differs fror?.- starting, substrates at 0 . lV-25i? o: , 
positions and has evolved at a rate orders of 'magnitude in 
excess (e.g., bv 'at least 10- fold,' iOO-.folc, 1000 - fold, or- 
10,00.0 fold) of. the rate of naturally acquired mutation: of ■ 
about 1 mutation per' 10* 5 . posit ions per generation-, (see ( .-. 
'Anderson et \al . ' Pro:. Natl..' Acad . Sci> U.S.A. 93: 906-907 
(1996)) . 'The "final- product" may* be* transferred to 'another 
.host. more des irabie ■* f or utilization .of the "shuffled" DNA. 
This is particularly advantageous" in 's ituat ions where the more 
desirable ' host is less efficient as a host for the many cycles 
of mutation/recombination due to the lack of molecular biology 
or genetic tools available for other organisms such as £ . 
coli. 
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(b) Virus-Plasmid Rpr 0m binatinn 
The strategy used for plasmid-plasmid recombination 
can also be used for virus -plasmid recombination; usually 
phage-plasmid recombination. However, some additional 
comments particular to the use of viruses are aporopriate 
The initial substrates for recombination are cloned into both 
Plasmid and viral vectors. It is usually not critical which " 
suostrate(s, are inserted into the viral vector and which into 
Che .plasmxd, although usually .the -viral vector should contain 
oxfferent substrate (s) from the plasmid. As before, the 
plasmid (and the virus, typically contains a selective marke>- 
1,16 PlaSmld and Viral vs «°" « n both be introduced into 
cells by transection as described above. However, a more 
etricient procedure is r - a^c , , 

^° L * anSiec ^ the cells with olasmid 
select transf ectants and ^ro^ rr . ar _,= 

^ une cransrectancs with v'rus 

Because the efficiency of — f oc .,- 3n 0 c ns >^, . . ' V 

m - Cw.on o. many viruses aDoroaches 

100 » or ceils, cost c^»s v . , . . 

* --^.s.ec-c ana miected by this 

route, contain both a d'^-^ .-■ 

a p.a^..^ virus oearing different 

substrates. 

Homologous recombination • occurs between olasxid and" 
virus generating both recombinec oiasmids and recombined 

virus. ■ For some viruses q.<~~ = e - \ 

s — 4 as — lamentous phage., in whic* 
intracellular DN'A ^s^s - - ~„ h ^ . , " 

^ r° un "uoie-stranded and s <p G i 0 . : 

stranded forms, coth ca.-, participate in recombination ' 

Provided that the virus is not or, 



recombir.a t ^ -^n , ^ . "~ . / 

— . „«.. «-- = -e.-e= py use of eiectroooracion or--, 
corougafion to transfer piasmids between cells. Recombination 
can also be augmented for some types of virus by allowing eh . 

30 ;^° 3Sny . VirUS ™ to "infect other cells. For ^ 

-30 types or virus, virus .^?o- r *~ r ,iu 

^e.-ceils snow resistance to 

superinfection. Howeve- su-'- -oc-^^^ 

' ' -"-stance can be overcome by 

injecting, at hiah n"U^i ; ^M- ^ ^ • 7 7 
^ a -g. 4 n_t_p_^i_ ana/or using mutant strains of 

tne virus m which res^s^^--^ e,,^~~- c 

^-s _ Q superinfection is reduced. 

The result of - - n.— ~i a cm ; , ... 
ic ■ , -.--~^.g piasmia-contaanmg ce^s 

Jb Wlt .h virus deDends on tr- «.*rn^ n ~ • ■ 

o, the virus. Some viruses, 
sucn as filamentous Dhaq« crpniv evi,.,. • , 

n j ' suaol/ exist with a- plasmid in the 

ceil and also extrude oroc^u nbao~ fmm ^ u 

. — P nage r rom the cell. Other 
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"viruses, such as lambda having a cosmid genome, stably exist 
in a cell like plasmids without producing progeny virions. 
Other viruses, such as the T-phage and lytic lambda, undergo 
recombination with the plasmid but ultimately kill the host 
cell and destroy plasmid DNA . For viruses that infect cells 
without killing the host,, cells containing recombinant 
plasmids and virus can be screened/selected using the same 
approach as ' f or plasmid-plasmid recombination. Progeny virus 
extruded by cells surviving selection/screening can also be 
collected and used as substrates , in subsequent rounds of 
recombination. For viruses that kill their host cells, 
recombinant .genes. resulting from recombination reside only, in 
the progeny virus. . If the screening or selective assay 
requires expression of recombinant genes in a cell, the 
recombinant genes should be transferred from the progeny virus 
to another vector, e.g., a plasmid vector, and retransf ected 
into cells before selection/screening is performed. 

For filamentous phage, the products of 
recombination are present ir. coth ceils surviving 
recombination and in phage extruded' from these ceils. The 
dual source of recombinant products provides some additional 
options relative to the plasmid-plasmid recombination. For . 
example, DNA can be' isolated from phage particles for use in a- 

' round of in vitro recombination.". Alternatively, the progeny 
ohage can be used to trar.sfect or infect cells surviving a 
previous round of screening/select ion , , or fresh ceils . . 
transfected with fresh substrates, for recombination. 

( c ) Virus-Virus Recombination , 
The 'principles, -described' for plasmid-plasmid and 

, ... , i- i rnr- k= aoolied to virus-virus 

olasmic- viral recomPina^iO:. c^>. ~- 

recombination with a few modifications. Tne initial 
•substrates for recombination -are cloned into a viral vector. 
• Usually, the same vector is user! for all substrates. 

Preferably, the virus is cr.e that, naturally or as a result of 
mutation, does not kill cells. After insertion, some viral 
genomes can be packaged in vitro or using a packaging cell 
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line. The packaged viruses are used to infect cells at high .- 
multiplicity such that there is a high probability that a cell 
will receive multiple viruses bearing different substrates. 

After the initial round of infection, subsequent 
steps depend on the nature of infection as discussed in the 
previous section. For example, if the viruses have phagemid 
genomes such as lambda cosmids or M13, Fl or Fd phagemids, the 
phagemids behave as plasmids within the cell and undergo 
recombination simply by propagating the cells. Recombination' 
is particularly efficient between single-stranded forms of 
intracellular DMA. Recombination can be augmented by 
electroporation of cells. 

Following selection/screening, cosmids containing 
recombinant genes can be recovered from surviving cells, e.g., 
by heat induction of a cos' iysccenic host cell, or ' extract ion 
of DNA by standard procedures, followed. by repackaging cosmic 
DNA in vi tro . '., 

Ir tne viruses are filamentous phage, recombination 
,of replicating form DNA occurs by propagating the culture of" 
infected cells. Selection/screening identifies colonies of 
ceils containing viral vectors having recombinant genes with'-' 
improved properties, together with phage -extruded from such - 
cells. Subsequent options arevessentially the same as for '•' 
P i a s ~ i d - v i r a 1 r s c o m b i r. 2 " ■ ^ *■> ' ' ■ ■ 

' - ■ Chromosome Recombination 
mis tcrrr.at can be used to especially evolve 
chromosomal substrates. The format is particularly useful m 
situations in which many chromosomal genes contribute to a 
phenotype or one does not know the exact location of the 
chromosomal gene(s) to be evolved. The initial substrates for 
recombination are cloned into a plasmid vector. If the 
chromosomal gene(s) to be evolved are known, the substrates 
constitute a family of sequences showing a high degree of 
sequence -identity but some divergence from the chromosomal 
gene. If the chromosomal genes to be evolved have not been 
located, the initial substrates usually constitute a- library 
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•of DNA segments of which only a small number show sequence 
identity to the gene or gene(s) to be evolved. Divergence 
between plasmid-borne substrate and the chromosomal gene(s) 
can be induced by mutagenesis or by obtaining the plasmid- . 
borne substrates from a different species than, that of the 
cells bearing the ' chromosome . 

The plasmids bearing substrates for recombination 
are transfected into cells having ■ chromosomal gene(s) to be; 
evolved.- Evolution can occur simply by propagating the 
culture, and can be accelerated by transferring plasmids' 
between cells by conjugation or electroporatibn . Evolution 
can be further accelerated by use of mutator host cells or by 
seeding a culture of ncnmutator ' host cells being evolved with 
mutator host cells and inducing intercellular transfer of. 
plasmids by electroporat'ipn or conjugation. - Preferably, 
mutator host ceils used' for seeding contain a negative 
selectable marker 'to facilitate isolation of a pure culture of 
the nonmutat.or cells being evolved. .- Selection/screening. 
•• identif ies cells bearing chromosomes and/or plasmids ..that have 
20 evolved coward acquisition of a desired function. 

Subsequent rounds o: recombination and •■ 
selection/screening proceed in similar fashion to' _thc.se 



10 



lb 



•scribed for plasmid-plasmlc recombination: . For example, 
her r eco~jo in a t - c r. oar. be 



ected' bv orooacacing cells 



25 surviving recombinacicr, in combination with eleccroporaticn or 
ccnjugative transfer of plasmids.. Alternatively, plasmids 
• bearing additional substrates tor recombination can be 
introduced' into the surviving ceils. Preferably, such • 
plasmids are from a different incompatibility group and bear a 

30 different selective marker than the original plasmids- to allow 
selection for cells containing at least two different 
plasmids. As a further alternative, plasmid and/or 
chromosomal DNA can be isolated from a- subpopulat ion of 
surviving ceils and transfected into a second subpopulat ion . 

'35 Chromosomal DMA can be cloned into a plasmid vector before 
transf e'ction . 
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(e) Virus -Chromosome Recombination 
As in the other methods described above, the virus 
is usually one that does not kill the cells, and is often a ■ 
phage or phagemid. The procedure is substantially the same a 
for plasmid-chromoscme recombination . Substrates for 
recombination are cloned into the vector. Vectors including 
the substrates can then-be transfected into cells or in -vitro 
packaged and introduced into cells by infection. Viral 
genomes recombine with host chromosomes merely by propagating 
a culture. Evolution can be accelerated by allowing 
intercellular transfer of viral genomes by electropora t ion , or 
reinfection of cells by progeny virions. Screening/selection 
identifies cells having chromosomes and/or. viral genomes that 
have evolved toward acquisition of a desired function. - 

There are several options for subsequent rounds of. 
recombination. For example , viral genomes can be transferred' 
between ceils surviving select icn/recocnbinat ion by 
electropora tier. . Alternatively, viruses extruded from cells' 
surviving selecticr./s— eer.ing can be pooled and used to * t 

superinfect the cells at high multiplicity. Alternatively, 
fresh substrates for recombination can be introduced into the' 
cells, either on p 1 a s ~ i d or viral vectors. > 

I: * Re cursive Sequence Recombination Techniques for Metabolic 
and Cellular Engineering 

A • Starting Materials 

Tnus, a general method for recursive secuence 
recombination for the embodiments herein is to begin with a 
gene encoding an enzyme or enzyme subunit and to evolve that 
gene either for ability to act on a new substrate, or for. 
enhanced catalytic properties with an old substrate, either 
alone or in combination with other genes in a multistep 
pathway. The term "gene" is usee herein broadly to refer to 
.any segment or sequence of DNA associated- with a biological 
function. Genes, can be obtained from a variety of sources; 
including cloning from a source of interest or synthesizing 
from known or predicted sequence information, and may include 



WO 97/35966 



PCT/TJS97/04715 



r 22 

"sequences designed- to have desired parameters. The ability to 
use a new substrate can be assayed in some instances by the 
ability to grow on a substrate as a nutrient source. 'In other 
circumstances such ability can be assayed by decreased 
toxicity of a substrate for a host cell, hence allowing the 
host to grow in the presence of that substrate. Biosynthesis, 
of new compounds, such as antibiotics, can be. assayed 
similarly by growth of an indicator organism in the presence 
of the host expressing ' the evolved genes. 'For example, when 
an indicator organism used in an, overlay of the host 
expressing the evolved gene (s) , wherein the indicator organism 
is sensitive or expected to be sensitive to the desired 
antibiotic, growth of the indicator organism would be 
inhibited in a zone around the host cell or colony expressing 
the evolved gene(s) .' 

Another method of identifying new compounds is the 
use of standard analytical techniques such as mass 
spectroscopy, nuclear magnetic resonance, high performance 
liquid chromatography, etc. Reccmbinan t microorganisms can be 
pooled and extracts or media superna tants assayed from these 
pools. Any positive pool can then be subdivided and the 
procedure repeated until the single positive is identified 
( 11 s ib - se iec t ion " ) . 

In some,, instances, the start ing 'material for 
recursive sequence recombination is a discrete gene, cluster 
of Genes , or familv of genes known or thought to be associated 
with metabolism cf a part icular_ class of substrates. 

One of . the advantages of the instant invention, is' 
that structural 'information is not. required- to estimate which 
parts of a secuehce ■ should be mutated to < produce a functional 
hybrid enzyme . 

In some embodiments of the invention, an initial 
screening of enzyme activities in a particular assay can be 
useful in identifying candidate enzymes as starting materials. 
For' example, high throughput screening can be used to screen 
enzymes for dioxygenase - type activities using aromatic acids 
as substrates. Dioxygenases typically transform indoie-2- 
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carboxylate and indole"- 3 -carboxylate to colored products, 
including indigo (Eaton, et.. al.. J. Bacterid . 177:6983-6988 
(1995)). DNA encoding enzymes that give some activity in th: 
initial assay can then be recombined by the recursive 
techniques of the invention and rescreened. The use of such 
. initial screening for candidate enzymes against a desired 
target molecule or analog of the target molecule can be 
especially useful to generate enzymes that catalyze reactions 
of interest such as catabolism of man-made pollutants. 

This type of high throughput screening can also be 
used during each round of recursive sequence recombination to 
identify mutants that possess the highest level of the desire: 
activity. For example, penicillin G acylases have beer, 
isolated by looking for clor.es chat allow a leucine auxotrooh 
to hydrolyse penicillin. G analogue phenylacetyl-L- leucine, 
thereby producing leucine arc allowing cell growth (Martin, U. 
St al " ~ SMS ■'■'icrobiolocy 125:287 -252 (1995)). 

Positives from this seiect-r. are then screened by a more 
labour- intensive method for ability to hydrolyse penicillin .<& 

ibis same seiecticr. or phenyl acetyl -L - leucine car. • 

be used w'-»-i o-.-o ^ : •■ ~ , 

_.wi_._.. = n u acyiase tor greater 

activity by recursive sequence recombination.' After each 

round of ^"co-h ; n,ir ''o- *- i ; v- - ^. , « z 

- j , — _.e _ _ w . a r / or acyiase genes is 

c ran 3 r orrr.e z ir.co a 1 vr .~ t-^^« 

- — :.:Ooe L.ia u grow zastest 

' * ' :.?. .'ir.c tr.e rr.cst active acyiase. The 

--..-ases are .r.er. ce testec against the real substrate, 
penicillin G, by a -ore laborious screen such as HPLC. Thus, 
ever, if there is no convenient high throughput screen for an' 
enzyme or a metabolic pathway, is often possible to find a 
rapid detection; method that can approximately measure- the- 
desired pher.otype, thereby reducing the numbers of colonies . 
that must be. screened more accurately. 

.The starting material can also be a segment of such 
a gene or cluster that .is. rer.:,-,b- ned in isolation of its 
surrounding DNA, but is rel:.r.,ec to its surrounding DNA be for- 
screening/selection of recombination products. m other 
instances, the starting material for recombination is a larg _ 
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segment of DNA that includes a coding sequence or other locus 
associated with metabolism of a particular substrate at an 
unknown location. For example, the starting material can be a 
'chromosome, ep'isome, YAC, cosmid, or phage PI clone. In still 
other instances, Che starting material is the. whole genome of ■ 
an organism that" is known to have desirable metabolic 
properties, but for which no information localizing .the genes 
associated with these characteristics is available*. . 

In general any type of cells can be used -as a 
recipient of evolved genes . Ceils of particular interest 
include many bacterial cell types, both gram-negative and 
gram-positive, such as Rhodococcus , Streptomycetes , 
Accinomyceces , Corynebacteria, Penicillium t Bacillus, 
Escherichia-, coli , Pseudomonas , Salmonella , and Erwinia. Cells 
of interest also include eukaryotic cells,, particularly 
mammalian ceils (e.g.', mouse", hamster, primate, human,), both 
cell lines and primary, cultures. Such cells include, stem 
cells, including embryonic ste~ cells, zygotes, fibroblasts, 
lymphocytes, ' Chinese- hamster cvary (CKO) , mouse fibroblasts 

(NIK3T3), kidney, liver, muscle, and skin ceils. Other 
eukaryotic cells of interest include plant _ cells such as 
maize,, rice, wheat, cotton, soybean, sugarcane, tobacco, and 
arabidopsis; fish, algae, fungi (Penicillin, Fusarium, 
Aspergillus,- Pogcspora, ::euros P ora) , insects, yeasts (Picchia 

and Saccharcmyces) . ■ . 

The choice of host will depend 'or. a number of. 

factors, depending on the intended use of the 'engineered host, 

including, pathogenicity, substrate range, environmental 

hardiness, presence of key intermediates, ease 'of genetic 
■manipulation, and likelihood of promiscuous transfer of 

genetic information to other organisms.. Particularly . 

advantageous hosts are Z .. col:, laccobacill i , Streptomycetes, 

Actinomycetes and filamentous zungi. 

The breeding procedure starts with at least two 

substrates; which generally snow' substantial sequence identity 
to each other (i.e., at least about 50%', '70%, 80% or 90% 
seouence identity) but differ from each other at certain 
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positions. The difference can be any type of mutation; for ■ 
example, substitutions, insertions and deletions. Often, 
different segments differ from each other in perhaps 5-20 
positions. For recombination to generate increased diversity 
relative to the starting materials, the starting materials 
must differ from each other in. at least two nucleotide 
positions. That is, if there are only two substrates, there 
should be at least- two divergent positions. If there a-e 
three substrates, for example, one substrate can differ from 
the second as a single position, and the second can differ 
from the third at a different single position.' The starting 
DNA segments can be natural variants of. each other, for 
example, allelic or species variants." The segments can also " 
be from nonallelic genes showing SO me degree of structural ' and 
usually functional relatednesa (e.g., different genes within a 
supertanu.y such as the immunoglobulin superfamily) . The V 
.starting DNA segments can also be induced variants of each *. 
other. For example, one DI.V, segment can be produced bv e~o- 
prone , pgr replication of the other., or by substitution' of a » 
mutagenic cassette. Induced mutants can also be oreoared bv . 
propagating one (or both) ' of the segments in a' mutagenic ' 1; ' 

s -— s^c.o, s-.rictly speaking, the second i 
DNA segment is not a single segment but a large family of 

related segments. The different s ■=><-- °-r=; ^—-n-, 

^-3 cs ^or....ng the starting 

-aterials are often the same length cr substantially the. same' 
-er.gtr;. However, this need-' not be. the case,-, for examoie,- one 
-segmeht-clh-oe "i sCTec^ence'"o"f' a'nother , " The 'segments can" be' ~ 
present as part of larger molecules, such as vectors, or can 
be in isolated form. 

The starting .DNA segments are recombined bv anv of 
the recursive sequence recombination formats described above • 
to generate a diverse library of recombinant DNA segments..' ' 
Such a library can vary widely m size from having- fewer ' than 
10 to more than 10>, io/, or iC members.' In general, the 
starting segments and the recombinant libraries generated 
include. full-length coding sequences and any essential 
regulatory sequences, such as a promoter and polyadenylat ion 
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sequence, required for' expression . However, if this is not 
the case, the recombinant DNA segments in the library can be ■ 
inserted into a common vector providing the missing sequences 
before performing screening/selection. 

If the, recursive sequence recombination format 
employed is an in -vivo format, the library of recombinant DNA 
segments generated already exists in a .cell, which is usually 
the cell type in which expression of the enzyme with. altered 
substrate specif icity is desired. If recursive' sequence 
recombination is performed in vitro, the recombinant library 
is preferably introduced info the desired cell type before 
screening/selection. " The members of the recombinant library 
•can be linked to an episome or virus before introduction or 
can be introduced directly. In some embodiments of the 
invention,' the library is amplified, in a first host, -and is 
then recovered- from that, hose and introduced' to a second host, 
more amenable, to expression, selection, or screening, or any 
other desirable parameter. The manner in which the- library is 
introduced into the cell type depends on the DNA-uptake 
characteristics of the ceil type, e.g. , having viral 
receptors, being capable 'of conjugation, or 'being naturally 
corr.oe tent"'. If the cell tyoe is* insusceptible to natural and 
chemical - induced competence, but susceptible to 

electrcporaticn, one would .usually employ ,e iect ropora t ion . If 
the eel 1- type . i s insusceot ibie to e lectrcpora t ion as well, one 
can emplov bioiistics. The bioiistic PDS-1000 Gene Gun 
(Biorad, Hercules, CA) uses helium pressure to accelerate 
D::A-coated, gold or tungsten microcarriers toward target cells. 
The. process is applicable to a- wide range of tissues, 
including plants, .bacteria, fungi, algae, intact animal 
tissues, tissue culture cells, and animal embryos . One can 
employ electronic' pulse delivery, which is essentially a mild 
electroporation format for live tissues in animals and 
oatients. Zhao, Advanced Drue Delivery Reviews 17:257-262, 
(1995). Novel methods for making ceils competent are described 
in co-pending application U.S. Patent Application Serial No, 
08/621,430, filed March 25, 1995. After introduction of the 
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, library of recombinant" DNA genes, the cells are optionally 
propagated to allow expression of genes to occur. 

B - Selecti on and Screening 

Screening is, in general, a two-step process in 

which one first determines, which cells do and do not exoress a 

screening marker and then physically seoarates th= ceUs 

having the desired property. Selection is a form of sc^eenino 

m which identification and physical separation are- achWed " 

simultaneously, for example, by expression of a selectable 

marker, which, in some genetic circumstances, allows eel's 

expressing the marker to survive while other cells di- (o - 

V1 " Versa) • Screening markers include, for exar^e ' 

luciferase, tf-galactosidase , and green fluorescent orote- 

■ Screening can also be cone by observing such asoects of g-ow^ 

as colony size, halo formation, etc. Additionally, screen^" 

tor production of a desired compound, such as a therapeutic 

, ~ ' ' S1 = '- cr.er.icai can oe accomplished, bv observe, 

binding of c=>i 1 o-o^... - _ , . 

, . . P-o__ ,o , receptor or ligand, such as -on a 

SOllG SUDDOr: C ^ a r-Vs. ~. . ~- 

- ~ °" a COiL ' ^ ucn screening can additionally 

. o, -.n^-ng -o a,:.:iooaies ( as in an ELI SA r r 

sone instances the s:— ^ . ^ . 

..o.ess pre:eraoly automated 

S °, aS to ailcw screening of suitable numbers of colonies o- 

c --~3 . Some exa~c<^~ o..-, . ■ _ 

a -" c, "^ e: screening cevices inciud- 

fluorescence ar ; v^- : . , , 

, , ^-^. i=! , especially in conjunction 

cei_s ir.r,coiii.zed m agarose (see Pow^i o C a i 

-B ip/Te chnn^nv 8 : 333-337 • fi996) ) " Weaver ec T al" Sheds' 2-2^4-' ' 
247 (1991),, automated EL ISA assays, scinti ilaticTo^inU 
assa ys (Hart, H.E. et al . , Molecule 16:2S " 5 . 267 " 

(1979)) and the formation, of fluorescent, coloured c- UV 
aosorbing compounds on agar plates or in microtia w ,n s 
(KraWiSC ' S " ^^^dus^_K^^ (1 • 

Selectable markers can include, for examoie, drug tox<~ 
resistance, or nutrient syndesis genes. Selection is also 
done by such techniques as' growth on a toxic substrate to 
select for hosts having the ability to detoxify a substrata 

growtn on a new nutrient source to select for hosts havina th- 
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•ability to utilize that nutrient source, competitive growth in 

culture based on ability to utilize a nutrient source, etc. 

In particular, uncloned but differentially 

' exoressed proteins (e.g., those induced in response to new 

5 ' compounds, such as biodegradable pollutants in the medium) can 

' be screened by differential display (Appleyard et al . MoL. 

. Gen . Gent. 247:338-342 (1995)). Hopwood ( Phil Trans R. Soc . ■ 

Lond B 324:549-562) provides a review of screens for 

• antibiotic production. Omura (Microbio. Rev. S0:259-279 

10 ' (19.86) and Nisbet (Ann Med. Chem. 21:149-157 (1986)) 

disclose screens for ' antimicrobial agents, including 

supersensitive bacteria, detection of ^lactamase and D,D- 

" carboxypeptidase inhibition,- '-lactamase induction, 

chromogenic substrates and monoclonal- antibody screens. 

15 Antibiotic targets can also be used as screening targets in 

. high throughout screening. Antifungals are typically screened 

by inhibition of fungal grow::,. Pharmacological agents can be 

. --identified as enzvrne inhibitors, using plates containing the 

•enzyme and a chromogenxc sues, rate, or by automated receptor 

< r er ., v -=>3 =.c, Droteas.es, amylases) can 
20 assavs. Hyarciy-ic en- /...- = - . 

^»-n»^ hv iic'u-" the- substrate in an agar plate and 

. hvr-cW-~ ~-ar zone or by using a colorimetric 
scoring r or a n/c.c./-^ ~ j 

• _ <crw^* a< »^ ^ M^^obibl. 4-5:89-106 
indicator (Steele e — — 

— -oiec with the use of stains :o detect 



2d 



(1991) ).. This c 

the effects of enzvr- ,-:cn (such as congo red to detect tn 



, c Ao^r-^--^- — c-> Closes and hemicelluloses ) . . 

- n w fi -icon r or examole, lipases and 



i el 



acred substrates 



~ a r\ 



using different lengths of- fa 

... ' u 0 -m ; Tho action of lioases or 

acids Imkec to unoeli^e.v., dLL * u 



_ _ r _ _ :v 

esterases can be screens-: 



..^c - = ~ th* fattv'acid, resulting in a 
3G esterases removes - n i s -n.-- < 

u ^^ 0 ^ ;-Honife-v! 'fluorescence., These 

■ Quenching or enhancement 

^ - - „^^^;- P v- nUrps "by ' a robotic 

enzymes can be screenec :r. ^^ e - ?iac ■ y 

fluorescence a:::v-ed ceil sorting (FACS) methods 
3s ' a- also a powerful tool fcr selection/screening. In some 

instances a fluorescent molecule is made within a cell (e.g., 
g^n fluorescent protein) . The cells producing the protein 
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can simply be sorted by FACS . Gel microdrop technology allow 
screening of cells encapsulated in agarose micirodrops (Weaver 
et al. Methods 2:234-247 (1991)). In thia technique products 
secreted by the cell (such as antibodies or antigens) are 
immobilized with the cell that generated them. Sorting and 
collection of the drops containing, the desired product thus 
also collects the cells that made the' product, and provides a 
ready source for the cloning of the genes encoding the desirec 
functions. Desired products can be detected by incubating the 
encapsulated cells with fluorescent antibodies (Powell et al. 
Bio/Technology 8:333-337 (1990)). FACS sorting can also be 
used by this technique to assay resistance to toxic compounds 
and antibiotics by selecting droplets that contain multiple 
cells (i.e., the produce of continued division in the presence 
of a cytotoxic compound; Goguer. et al . Nature 363:189-190 
(1995)}. This method car. select for any enzyme that can £ l 
change the fluorescence of a substrate that can be immobilized 
in the agarose crcplet. 

In some embedimer. : s of the invention, screening can 
be accomplished by assaying reactivity with a reporter 
molecule- reactive with a desired feature of, for ■ example , a 
gene product. Thus, specific functionalities such as 
antigenic domains can he screened with antibodies specific for 
those determinants. ' . *'* 

In other embodiments of t he . invent ion , screening is 
preferably done with a' cell-cell indicator assay. In this 
-assay format/ separate ' library cells ( Cell." A,' the ceVi'~being 
assayed) and reporter ceils (Ceil 3 , the assay cell) are used. 
Only one component of the system, the library cells, is 
allowed to evolve. The screening is generally carried out in 
a two-dimensional immobilized format, such 'as on plates. The 
products of the metabolic . pathways encoded by these genes (in 
this case, usually secondary metabolites such as antibiotics, 
polyketides, carotenoics, etc . j . ci f fuse out of the library 
cell to the reporter cell, The product of the library cell 
-ay affect the reporter cell in one of a number of ways. 
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The assay system (indicator cell) can' have a simple 
readout (e.g., green fluorescent protein, luciferase, 0- 
galactosidase) which is induced by the library cell product 
but which does not affect the, library cell. In these examples 
the desired product can 'be detected by color imetric changes in 
■the reporter cells adjacent to the library cell. 

In other embodiments', indicator cells can in turn 
produce, something that modifies the growth rate of the library 
cells via a feedback mechanism. Growth rate feedback can 
detect and accumulate very small differences . For example, if 
the library and reporter cells are competing for nutrients, ' 
library cells producing compounds to inhibit the growth of the 
reporter cells will have more available nutrients, and thus 
will have more opportunity 'for growth. , This is a useful 
screen for antibiotics or a' library of poiyketide synthesis 
gene clusters where each of the library cells is expressing 
and exporting a different: poiyketide gene product.. 

Another variation cf this, theme is that the 
reporter ceil for an an:ib:o:;c selection can itself secrete a 
toxin or antibiotic that inhibits growth of the library- ceil, 
Production by the library ceil c: an antibiotic that is able 
to suppress growth of '.the reporter, cell will tnus allow 
uninhibited growth of the' library ceil. 

Conversely, if the library is 'being screened for 
product ion , cf 1 a compound that stimulates the growth of the 
reporter ceil (for example, in improving , -chemical syntheses, 
the library ceil may supply nutrients such as amino acids to 
an auxotrophic reporter, ,or 'growth, factors, to a growth- factor- 
dependent reporter. The reporter ' ceil in turn should produce 
a compound that' stimulates the' 1 growth of the library ceil. 
In.cerleukins / growth factors, and nutrients are possibilities. 

'Further possibilities include competition based on 
ability to kill surrounding cells, positive feedback loops in 
which the desired product made by the evolved ceil stimulates 
the. indicator cell to produce a positive growth factor for 
cell A, thus indirectly selecting- for increased product 
formation. 
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In some embodiments of che invention it can be 
advantageous to use a different organism (or genetic 
background) for screening than the one that will be used in 
the final product. For example, markers can be added to DNA 
constructs used for recursive sequence recombination to .make 
the microorganism dependent on the constructs during the 
improvement- process, even though those markers may be 
undesirable in the final recombinant microorganism. 

Likewise, in some embodiments it is advantageous to 
use a different substrate for screening an evolved enzyme than, 
the one that will be used in the final product. For example, 
Evnin et al. (Proc. Natl. Acad. s c i. U.S.A. 87:6659-6653 
(1990)) selected trypsin variants with altered substrate 
specificity by requiring that variant trypsin generate an 
essential amino acid for an argi-ine auxotroph by cleaving 
arginine 0-naphthylamide . This is thus a selection for *" 
arginine-specific trypsin, with the growth rate of the host ~ v ' 
being proportional to that c: the enzyrae activity. ■■■ 

The pool of ceils surviving screening and/or 
selection is enriched for recombinant genes conferring the 
desired phenotype (e.g. altered substrate specificity, altered'- 
biosyr.thetic ability, • etc. ) . Further enrichment can be 
obtained, if desired, by performing a second round of 
screening and/or selection without generating additional a? 
diversity. 

The recombinant gene or pool of such genes 
surviving one round of screening/selection forms one or more" ""' 
c: the substrates for a second round of recombination. Again, 
recombination can be performed ir. vivo or in. vitro by any of 
the recursive sequence recombination formats described above. 
If recursive sequence recombination is performed, in vitro, the 
recombinant gene or genes to form the" substrate for 
recombination, should be extracted from che cells in which • 
screening/selection- was performed. Optionally, a subsequence 
of. such gene or gene's can be excised for more targeted 
subsequent recombination. If the recombinant genets) are 
contained within episomes, their isolation presents no 
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"difficulties. If the recombinant genes are chromosomally 
integrated, they can- be isolated by amplification primed from 
known sequences flanking the regions, in which recombination 
has occurred. Alternatively, whole genomic DNA can be 
5 isolated, optionally amplified, and' used as the substrate for. 
recombination. Small samples of genomic DNA can be amplified 
by whole genome amplification with degenerate primers. (Barrett 
et NTn^i^,- Arfris Research 23:3488-3492 (1995)). These 

primers result in a large amount of random 3' ends, which can 
10 undergo, homologous recombination when reintroduced into cells. 

•If the second round of recombination is to be 
performed in vivo, as is often the case, it, can be' performed 
in the cell surviving screening/selection, or the recombinant 
genes can be transferred to another cell type (e.'g., a cell. 
15' type having a. high frequency of 'mutation and/or 

recombination). In this .situation, recombination 'can be 
effected by. introducing additional DNA segment (s) into cells 
bearing the recombinant genes. In other methods, the cells 
.can be induced to exchange generic information with each other 
20 by,- for example, electroporat ion . ' In some methods, the second 
round of recombination- is performed by dividing a pool cf 
cells surviving screening/selection in the first round into 
two subpopulations. DNA from one subpopulat ion is isolated 
and transfected into the • other population, where the 
25. recombinant, gene'(s) fro*, the two subpopulations recombine to 
form a further . library cf recombinant genes. In these 
methods, it is not necessary to isolate particular genes from 
. the, first subpopulat ion or to take steps to avoid random 

shearing of DNA during- extraction. ' Rather, 'the whole genome 
30 of DNA. sheared or otherwise cleaved into manageable sized 

fracments is transfected into the second subpopulat ion . This 
approach is particularly useful when several genes are being 
evolved simultaneously and/or the location and identity of 
such genes within chromosome are not known. 
35 • The second round of recombination is sometimes 

performed exclusively among the recombinant molecules 
surviving selection. However, in other embodiments, 
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additional substrates can be introduced. The additional 
substrates can be of the same form' as the substrates used in 
the first round of recombination, i.e., additional natural or 
induced mutants of the gene or cluster of genes, forming the 
substrates for the first round. Alternatively, the additional 
substrate (s) in the second round of recombination can be 
exactly the same as the substrate (s) in- the first round of 
replication. 

After the second round of recombination, 
recombinant 'genes conferring, the desired phenotype are again 
selected. The selection process proceeds essentially as 
before. If a suicide vector bearing a selective marker was 
used in the first round of selection, the same vector can be 
used again. Again, a cell or pool of cells surviving 
selection is selected. If a pool of ceils, the cells can be. 
subject Co further enrichment . «.£ 

1 1 1 • Recursive Seouencp ?^nrhi n3 tion of Genes For 

Bioremediat ion ■ > 

Modern industry Generates many pollutants for which 
the environment can no longer be considered an infinite sink'/ 
Naturally occurring microorganisms are able to metabolize 
thousands of organic cc~.pou.nds .' including many not found in v ' 
nature (e.g xencbio: ics ) . Bioremediation , the deliberate use' 
oz microorganisms for the biodegrada t ion of man-made wastes, 

. 1S e " ,er9i . n .? . tech ' z>lo ?y. 5 h . ac o"?rs _ cost and practicality 
advantages over traditional methods of disposal. The success 
of bioremediation depends on the availability of organisms 
that are able to detoxify or mineralize pollutants. 
Microorganisms capable of degrading specific pollutants can De- 
generated by genetic engineering and recursive sequence 
recombination. 

Although bioremediation is an aspect of pollution 
control, a more useful approach in the long tern is one of 
prevention before industrial waste is pumped into the 
environment.- Exposure o: industrial waste streams to 
recursive sequence recombinat ion -generated microorganisms 
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capable of degrading the pollutants they contain would result 
in detoxification of mineralization of these pollutants before 
the waste stream enters the environment. Issues of releasing 
recombinant organisms can be avoided by containing them within 
bioreactors fitted to the industrial ef fluent ' pipes . This J 
approach would also allow the microbial mixture used to be 
adjusted to best degrade' the particular wastes being produced,, 
.Finally, this method would avoid the problems of adapting to 
the outside world and. dealing with competition that face many 
laboratory microorganisms. 

In the wild, microorganisms have evolved new 
catabolic activities enabling them to exploit pollutants 
as nutrient sources for which there is no competition. 
However, pollutants that are 'present -at low 
concentrations ' in the environment may ,n,ot provide a 
sufficient advantage to stimulate the. evolution 1 of 
catabolic enzymes, -For a- review of' such naturally 
occurring evolution of bicdecradat ive. pathways and the"' 
manipulation of some of microorganisms by classical 
techniques, see Ramos et al,., Bio /.Technology 12:1349-1355 
(1.9.9.4) , ' ' 

Generation of new catabolic enzymes or pathways 
for bioremediat ion has thus relied upon deliberate, 
transfer of- specific genes between organisms (Vvackett et 
a!.. , supra) , forced ma tings between bacteria with 
specific catabolic capabilities (Brenner et. al. 
Bicdecradaticn 5:359-377 (1994)), or prolonged selection 
in a ''chemostat . Some researchers .have attempted t'o - ■ 

facilitate evolution via naturally occurring genetic / 
mechanisms in their chemostat selections by' including ' 
microorganisms with a variety , of- catabolic pathways 
(Kellogg et . al. Science 214.: 1133 - 1135 (1981); 
Chakrabarty American Society of Micro. 3iol. News 
62:130-137 (1995)). For a review of efforts in this 
area, see Cameron et al . Applied Bioc'nem. Biotech. 
38:105-140 (1993). 
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Current efforts in improving organisms for 
bioremediation take a labor-intensive approach in which 
many parameters are optimized independent ly,- including 
transcription efficiency from native and heterologous 
promoters, regulatory circuits and translat ional 
efficiency as well as improvement of protein stability 
and activity (Timmis et al. Ann. Rev. Microbiol . 48:525- 
527 (1994) ) . 

A recursive sequence recombination approach 
overcomes a number of limitations in the bioremediation 
capabilities of naturally occurring microorganisms. 
Both enzyme activity and specificity can be altered, 
simultaneously or sequentially, by the methods of the 
invention. For example, cataboiic enzymes can be evolved 
to increase the rate at which' they act on a substrate. 
Although knowledge of a rate - limi t ing step in a metabolic.;;', 
pathway is not required to praecipe the invention, rate- - 
limiting proteins in pathways car. be evolved to have 
increased expression and/or activity, the requirement for- 
inducing substances can be eliminated, and enzymes can be 
evolved that catalyze novel reactions. 

Scm.e examples cf chemical targets for . 
oicremediat icn include but are not limited to benzene, 
xylene, and toluene, camphor, naprthaiene, halog'e nated ■ • 
hydrocarbons, poiych lor ma ted biphenyis (?C3s), 
tnehiorethyiene , pesticides such as pentachiorophenyis 
A?C.?s) and herbicides -such- as -at rac i ne : ' ~ 

A . . Aromatic Hydrocarbons 

Preferably, when an enzyme is "evolved" to • 
have a new catalytic function, that function is 
expressed, either cons 1 i tut ively' or in response- to the 
new substrate. Recursive sequence recombination ■ sub j ects 
both structural and regulatory elements (including the 
structure of regulatory proteins) of a protein to 
recombinogenic mutagenesis simultaneously. Selection of 
mutants that are efficiently able to use the new 
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* substrate as a nutrient source will be sufficient to 
ensure that both the enzyme and its regulation are, 
optimized, without detailed analysis of either protein 
structure or operon "regulation. 

- Examples of aromatic hydrocarbons include but 
are not limited to benzene, xylene, toluene, biphenyl, 
•and polycycli.c aromatic hydrocarbons such as pyrene and 
naphthalene. These compounds are metabolized via 
catechol intermediates. Degradation of catechol by. 
Pseudomonas putida requires induction of the catabolic 
operon by cis, cis-muconate which acts, on the CatR 
regulatory protein. The binding site for the CatR 
protein is G-N u - A, while the optimal sequence for the 
LysR class of activators (of which ,'CatR is a member) is 
T-N^-A. Mutation of the G- to a T. in the, CatR binding 
site enhances the expression of catechol metabolizing . 
genes'. (Chakrabarty, American Society of Microbiology News 
62:130-137 (1996)). This demonstrates that the control 
■of existing catabolic pathways is not optimized for the 
metabolism of specific xenobiot ic.s . It is' also an 
example of a type of mutant that would be expected from ■ 
recursive sequence . recomb.inat ion of the operon followed .. 
by selection of bacteria that are better able to degrade 
the—target compound. ' 

.As an examole c: starting materials, 
dicxvqenases are retired for many pathways in which 
aromatic compounds ' are catabolized. Even small 
differences in dioxygenase sequence can lead 'to 
significant differences in substrate specif icity . 
(Furukawa e't al . J. Sac:. ' 17 5 -.5224- 52 32 I 1.9 9 3 ) ; .'Er ickson - 
or .1 Environ. Micro. 59:3858-3662 (1993)). A, 

hybrid enzyme made using sequences derived from two' 
"parental" enzymes may possess catalytic activities that 
are intermediate between the parents (Er ickson, ibid.),, 
or may actually be better than either parent for a 
specific reaction (Furukawa et ai . J^Mct. 176:2121-2123 
(1994)) . In one of these cases site directed mutagenesis 
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■ was used to generate "a single polypeptide with hyb-d 
sequence (Erickson, ibid.); in the other, a four subunit 
enzyme was produced by expressing two subunics from each 
of two different di oxygenases' (Furukawa, ibid.) Thus 
. sequences from one or more genes encoding dioxygenases ' 
can be used in the recursive sequence recombination 
techniques of the instant invention, to generate enzymes 
with -new specialties. m addition; ■ other features of 
the catabolic pathway can also be evolved using t „es- 
techniques, simultaneously or sequentially,., to ootimize 
the metabolic pathway for an activity of interest., 

B - Ha-ioaena r.ed Hy Hrn carbnn..; 

Large quantities of halogenated hydrocarbons 
are produced annually for uses as solvents and biocid- 
Tnese include, in the United States alone, over 5 nvl^ 
• -° nS ° f b ° th ^2-dichloro S tha.,e and vinyl chloride used 
; n PVC pr ° duCCion iR ch * alone. The comoounds ar- 

largely not biodegradable cv processes in single 
organisms, although in. principle haloaromatic catabo-c 
pathways can be constructed by combining genes from. 

cifrerent microorqa" -' s^ . . 

y-..-s„.s. -2y.,es can be manipulated to 

cnange tr.eir substrate specificities. Recursive s-ou-nc- 
recombination offers the possibility.of tailoring enzvm-" 
specitacity to new substrates without needing detailed, 
structural analysis of the enzymes. 
.. , .. A ^ an •e X ampTe of - p o S Tibli--s^^^ 

-or tne methods of the instant invention, Wackett et a' 
(Nature 363:627-629 (1994), recently demonstrated that^ 
tnrough classical techniques, a recombinant Pseudomonas 
strain in which seven qen^s on---;- ri'-,,^- 

oxygenases are combined, generated a! single host that can 
.etaooliz-e polvhalogenated compounds by secuential 
reductive and oxidative techniques to yield non-to^c 
products. These and/or related materials can b- 
suo 3 ected to the techniques discussed above so* as to' 
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: evolve and optimize a" biodegradative pathway in a single 
organism. 

Trichloroethylene is a significant groundwater 
contaminant. ■ It is degraded by .microorganisms- in a 
5 cometabolic way (i.e., no energy or nutrients are 

derived) . The enzyme must be induced by a different 
compound (e.g., Pseudomonas cepacia uses 
toluene-4 -monoxygenase, which requires ■ induction by 
toluene, to destroy trichloroethylene). Furthermore, the 

10 . degradation pathway involves formation of highly reactive . 
epoxides * that can inactivate the enzyme (Timmis et al . 
Ann . Rev . Microbiol . 48:525-557 (1994)). The recursive 
sequence recombination techniques of the invention could 
be used to mutate the enzyme and its regulatory region . 

15" ' such that it is* produced cons t i tut ively . and is less t ' ■ 

susceptible to epoxide inact : vation . In some ■ embodiment s 
of the invention, selection of hosts const i tut ively 
producing the enzyme 'and less susceptible, to the epoxides 
can be accomplished by demanding .growth in the presence 

20 of increasing concentrations o-f trichloroethylene in the 

absence of inducing substances. 

C. Polvchlorinated. Biohenvls (PCBs) and 

Polvcvclic Aromatic Hydrocarbons (PAHs) • • 

25 ' ?C3s and . ? AMs are families of structurally '■ 

related compounds that are major pollutants at many 
Suoer fund sites. Bacteria transformed with plasmids . 
•encoding enzymes with broader substrate specificity have 
been used commercially. In nature, no known pathways • nave 
30 been generated in a single host that degrade* the larger 

PAKs or more heavily chlorinated- ?C3s . Indeed,, often the 
collaboration of anaerobic a:\z aerobic bacteria are 
required for complete metabolism. 

Thus, likely sources for starting material for 
35 recursive sequence recombination include identified genes 

encoding PAK - degrading cataboiic pathways on large (20- 
100KB) plasmids (Sanseverino et al. A pplied , Environ . 
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Micro 59:1931-1937 (1993); Simon et al . Gene 127:31-37 

(1993) ; ZyLstra et al. Annals of the NY Acad. Sci . 
721:386-398 (1994)); while biphenyl and PCB-metabolizing 
enzymes are encoded by chromosomal gene clusters, and in 
a number of cases have been cloned onto plasmids (Hayase 
et al - J. BacterioT. 172:1160-1164 (1990); Furukawa et 
al. Gene 98:21-28 (1992); Hofer et al. Gene 144:9-16 

(1994) ). The materials can be subjected to the 
techniques discussed above so. as to evolve a 
biodegradative pathway in a single organism. 

Substrate specificity in the PCB pathway 
largely results from enzymes involved in initial 
dioxygenation reactions, and can be significantly altered 
by mutations in those enzymes (Erickson et al. Applied 
Environ, Micro 59:3858-38662 (1993); Furukawa et al. 
Bacj_ 175:5224-5232 (19S3). Mineralization of ?AHs and ,<£ 
?C3s requires that the downstream pathway is able to *l 
metabolize the products cf the initial reaction (Brenner;, 
et al - Biodearaca; ion 5:359 -377 (1594)). In this case/ 'v 
recursive sequence reco^b-a t :cr : of the entire pathway 
with selection for bacteria able to use the PCB -or PAH as, ■ 
tne sole carbon source will allow production of novel PCB 
a no PAH degradinc bacteria. 

D . Herbicides 

A general method fcr evolving genes for the' 
catabo-lis™ of- insoluble herbicides is " exemplified as " 
follows for atrazme. Atrazme [ 2 - chloro- 4 - ( e thylamino ) - 
S- (isopropylamino) -l f 3, 5-triazme] is a moderately 
persistent herbicide which is frequently detected in 
ground and surface water at concentrations exceeding the 
3 ppb health advisory level set by the EPA. Atrazme can 
be slowly metabolized by a Pseucomonas species 
(Mandelbaum et al. Appl . Environ . Micro . -61:1451-1457 
(1995)). The enzymes catalyzing the first two steps in 
atrazine metabolism by Pseucomonas are encoded by genes. 
AtzA and Atz3 (de Souza et al . Aool . Environ Mir>-n 
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61:3373-3378 (1995)).' These genes have been cloned in a 
6.8 kb fragment into pUCIS (Atz'AB-pUC) . ■ E. coli carrying 
this plasmid converts atrazine to much more soluble 
metabolites. It is thus possible to screen for enzyme 
activity by growing bacteria on- plates containing 
atrazine. The herbicide forms an opaque precipitate in 
the plates, but cells containing. AtzAB-pU18' secrete 
atrazine degrading enzymes, leading to/a clear halo 
around those cells or colonies. Typ ically, the size of 
the halo and the rate of its formation can be used to 
assess the level, of activity so that picking colonies 
with the largest halos allows selection of the more 
active or highly produced atrazine degrading enzymes. 
Thus, the plasmids carrying these genes can be subjected 
to the recursive sequence ' re-comb i nation formats described 
above -to optimize the cacabolism of atrazine in E. coli 
or another host of choice, including Pseudomdnas . After 
each round of recombina t icr. screening of host colonies 
expressing the evolved genes can be done on agar plates 
containing, atrazine to observe halo formation. This is a 
generally applicable method for screening enzymes 'that 
me t a bold ze insoluble compounds to .those that are . soluble 
/le.g. , po 1 ycyc lie arc-mac ic hydrocarbons)'. Additionally, 
catabolism of atrazine can provide a source of nitrogen 
for the, cell; if no other nitrogen is available, cell 
growth wi'i.l be limited by the rate at which the cells can 
catabclize- nitrogen. Cells able to. utilize atrazine as a 
nitrogen source can thus be selected from "a background of 
r^n - u t i 1 i z e r s or poor - ut i 1 i zers . 

"" ■' ' S.'. Heavy' Metal Detoxification ■ 

3acteria are used commercially to detoxify 
arsenate waste generated by the* mining- of arse.nopyri te 
gold ores. As well as mining effluent, industrial waste 
water is often contaminated with heavy metals (e.g., 
those used in the manufacture of electronic components 
and plastics) . Thus, simply to be able to perform other 
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bioremedial functions! microorganisms must be resistant 
to the levels of heavy metals present, including 
mercury, arsenate, chronate, cadmium, silver, etc. 

A strong selective pressure is the ability to 
metabolize a toxic compound to. one less toxic. Heavy 
metals are toxic largely by virtue of their ability to 
denature proteins (Ford et al. Bioextract i nn 
Biodeterioration of Merals , p. 1-23). Detoxification of 
heavy metal contamination can be effected in a number of 
ways including changing the solubility or 
bioavailability of the metal, changing its redox state 
(e.g. toxic mercuric chloride is detoxified by reduction 
to the much more volatile elemental mercury) and even by 
bioaccumulation of the metal by immobilized bacteria or 
plants. The accumulation of metals to a sufficiently 
high concentration allows metal to be recycled; smelting 
burns off the organic par- of the organise, leaving 
behind reusable accumulated rr.etai. Resistances to a 
number of heavy- metals (arsenate, cadmium, cobalt, 
chromium, copper, mercury, r.ickei, lead, silver, and 
zinc) are plasmic encoded in a .-.umber of species 
including Staphylococcus and Pseucomonas. (Silver et al. 
Environ. Health Persoec-. 102:107-113 (1994),- Ji et al. 
J. Inc. Micro I-1.-S1-75 (1995)) . These genes also confer 
heavy metal resistance or. other species as well (e.g., 
coii) . The recursive sequence recombination techniques 
of the- instant- invention- (RSR) can be" used to increase " 
microbial heavy metal tolerances, as well as to increase 
the excent to which "lis will accumulate heavy metals. 
For example, the ability of E . coli, to detoxify arsenate 
can be improved at least 100-fold by RSR (see co-pending 
Application Serial No. 03/621 ,-355'. filed March 25, 1996). 

Cyanide is very efficiently used to extract 
gold from rock containing as little as 0.2 02 per ton. 
This' cyanide can be micrcbiaiiy neutralized and used as a 
nitrogen source by fungi or bacteria such as Pseudomonas 
fluoresceins.- A problem with microbial cyanide 
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degradation is the presence of toxic heavy metals in the' 
leachate. RSR can be used to' increase the resistance of 
bioremedial microorganisms to toxic heavy metals, so that 
they will be able to survive the levels present in many 
industrial and Superfund sites. This will allow' them to 
biodegrade organic pollutants including but not limited 
to aromatic hydrocarbons/ halogenated ' hydrocarbons , and 
biocides . 1 . 

' F . Microbial Mining ■ 

"Bioleaching" is the' process by which microbes 
convert insoluble metal' deposits, (usually metal sulfides 
or oxides) into soluble > metal sulfates. Bioleaching is 
commercially, important in. the mining of arsenopyri te , but 
has additional potential in. the detoxification and 
recovery of metals, and acids from waste dumps. Naturally 
occurring bacteria capable of bioleaching are reviewed by 
Rawlings and Silver ( Bio/Technology 13:773 -778 (1995)). 
These 'bacteria are typically divided into groups by their' 
preferred temperatures for growth. The more important 
mesophiles are Thiobacillus- and' Leptospi ri 1 Ivjd species. 
Moderate thermoohiles include Sclfobacill us species. 
Extreme t he rmophi 1 e s include Sul folobus species. Many of. 
these organisms are difficult to' grow in commercial 
industrial settings, making their cataboiic abilities 
attractive candidates for transfer to and optimization in 
other organisms such- as Pseudorr<.orias , Rhodococcu's T . , 
ferrooxidans or E . coli. Genetic systems are available 
for at least one strain of t . f errooxidans , allowing the 
manipulation of its .genetic material on plasmids : . 

The recursive secuence recombination methods 
described above can be used to optimize the catalytic 
abilities in native hosts or heterologous hosts for 
evolved bioleaching genes or pathways, such as the 
a'bility to convert metals from insoluble to soluble 
salts. .In addition, leach rates of particular ores can 
be improved as a result of, for example, increased 
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resistance to toxic compounds in the ore concentrate, 
increased specificity for certain substrates, ability to 
use different substrates as nutrient sources, and so 



on. 



G - Oil Desulfurizaf inn 

The presence of sulfur in fossil fuels has been 
correlated with corrosion of pipelines, pumping, and 
refining equipment, and with the premature breakdown of ' 
combustion engines. Sulfur also poisons many catalysts 
used in the refining of fossil fuels. The atmospheric 
emission of sulfur combustion products is known as acid 



rain 



Microbial dssulf urization is an aopealing 
bioremediation application. Several bacteria have been 
reported that are capable of catabolizing 
dibenzothiophene (D3T) , which is the representative 
compound of. the class of s,lf,r compounds found in f OM <i 
fuels.- U.S. Patent No. S.l-SS.SCl discloses the clor.inc 

o: a DNA ~o'»ci]l= fT>-> 

U C '°- •-•-'---•-•---s rnoaocnrous caoable of 

oiocatalyzir.g the cesulf u-rization of oil. Denome et al 
(Gene 175:6890-6901 (1955;.; disclose the cloning of a 
S.S fcb DNA fragment fron Fse^o.onas encodir.a the uooe>- 
r.apr.thalene catabolizing pathway which also degrades 
cibenzothiophene. Other gene's, nave been identified that 
?er: ° r ~ ?i ~j lsr fun "- i:r - s *--sclosed in U.S. 5,355,601) . 

• ?ne a = clvlty cf • chese sr.zyrr.es is currently too 

cone . .^able, o-ju cne pathway could be 

increased in efficiency using the recursive .sequence ■ 
recombination techniques of the invention. The desired 
property of the genes of interest is their ability to . 
aesulfurize dibenzothiophene or its alkyl or aryl 
substituted analogues. In son- embodiments of the 
invention, selection is preferably accomoiished by 
coupling .this pathway to one providing, a nutrient to t^ 
bacteria. Thus, for example, dasulf urization of 
dibenzothiophene results i., formation of hydroxybiohenyi 
ims is a substrate for the oipnenyl -catabolizing oathway 
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which provides carbon and energy. Selection would. thus 
be done by "shuffling" the dibenzothioph'ene genes and 
transforming them into a host' containing the biphenyl- 
catabolizing pathway. Increased dibenzothiophene 
desulf urizat ion will result in increased nutrient 
availability and increased growth rate. Once the genes 
have been evolved they are easily separated from the 
biphenyl degrading genes. The latter are undesirable in 
the final product since the object is to desulfurize 
without decreasing the energy content of the oil. Alkyl 
or aryl substituted dibenzot hiophenes can be detected by 
changes in fluorescence (Krawiec, S., Devel . Indus . 
Microbiology 31:1.03 - 114, (1990)) or by detection of phenol 
groups formed as a result of desul f urizat ion (Dacre/ J.C. 
Anal .' Cher?.. 43:589 - 591 (1971)}. - 

H ■ Organo-Nitro Compounds 

Organo-nitro compounds are. used as explosives, 
dyes, drugs,, polymers and ar. t imicr-obial agents. 
Biodegrada t ion, of these ccrr ; pcur.ds occurs usually by wa-.y 
of reduce ion of the nitrate, group, catalyzed by 
.nitroreductases/' a, family of broadly - speci f ic enzymes.- 
Partial reduction of orcano- r.i t ro compounds often results 
in' the -formation of a compound more toxic than- the 
original (Hassan e: ai. 1 979 Arch Bioch - 3100 . 196:385- 
,395) Recursive sequence . recorjDination of 
nitroreductases can produce enzyme s that are. more, 
specific, and able to more completely reduce (and thus 
detoxify) their target compounds (examples of which 
include but are not ■ limited to ni t rotoiuenes ■ and 
nitrobenzenes) . Nitro- reductases . can be isolated from 
bacteria isolated f rom expios ive - contaminated soils', such 
as Morgana 11 a morgan i i and Er.zerobacter cloacae {Bryant 
e t al., 1991. J. Biol Cher, 266:4126-4130). A preferred 
selection method is to look for increased resistance to 
the organo-nitro compound of interest, since that will * 
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indicate that the enzyme is also able to reduce any toxic 
partial reduction products of the original compound. 

Iv - Use of Alternati ve Substrates for Chemical Synthesis 

Metabolic engineering can be used to alter 
microorganisms that produce industrially useful 
chemicals, so that they will grow using alternate and 
more abundant sources of nutrients, including human- 
produced industrial wastes. This typically involves 
providing both a transport system to get the alternative 
substrate into'- the engineered cells and catabolic enzymes 
from the natural host organisms to the engineered cells. 
In some instances, enzymes can be secreted into the 
medium by engineered cells to degrade the alternate 
substrate into a form that can more readily be taken uo 
by the engineered cells; in other instances, a batch of 
engineered cells can be crown cn one preferred substrate, 
then lysed to liberate hydrolytic en2ym.es for the 
alternate substrate into the medium, while a second 
inoculum of the same engineered host or a second host is 
added to utilize the hvdrolvzat ^ 

The starting materials for recursive sequence 
recombination will typically be genes for utilization of 
a substrate or its transport. Examples of nutrient 
sources 'of interest include but are not limited to 
lactose, whey, galactose, mar.nitol, ;<ylan, cellobiose r 
'cellulose and sucrose, thus allowing cheaper" product ion 
of ..compounds including but not limited to ethanol, 
tryptophan, rhamnolipid surfactants, xanthan gum, and 
poiyhycroxylalkanoate . For a review of such substrates 
as desired target substances, see Cameron et al . ( AppI . 
Biochem. Biotechnol . 38:10-5-140 (1993)). 

The recursive sequence recombination methods 
described above can be used to optimize the ability of 
native hosts or heterologous hosts to utilize a .substrate 
of interest, to evolve more efficient transport systems, 
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. to increase or alter specificity for certain substrates, 
and so on, 

. V. Biosynthesis 
5 Metabolic engineering can be used to alter 

organisms, to optimize the; production of practically any 
metabolic intermediate, including antibiotics, vitamins, 
amino acids such, as phenylalanine and aromatic amino 
acids, ethanol, butanol, polymers such as xanthan gum and 
10 bacterial cellulose, peptides, and lipids. When such 

compounds are already produced by a host, the recursive 
sequence recombination techniques described above can be 
.used to optimize production of the desired metabolic 
intermediate, including such features as increasing 
15 ■" ' enzyme substrate specificity and turnover number 

altering metabolic fluxes to" reduce the concentrations of 
■ toxic substrates or intermediates, increasing resistance 
- ; of the host to such toxic compounds,, eliminating, 
reducing or altering the need for inducers of gene 
20 expression/activity, , increasing the production of enzym.es 

essary for metabolism, etc. 

'. Enzvmes can also be evolved for improved 
tivity in solvents other than water. This- is useful 
because intermediates in chemical syntheses are often 
25 . protected by blocking groups which dramatically affect 

the solubility of the compound in aqueous solvents. Many 
compounds can be produced by a combination of pure , , 
chemical and enzymicai iy catalyzed reactions. .'Performing 
: enzymic. reactions oh almost insoluble substrates is 
> 30 ' clearly very inefficient,^ the, availability of enzymes, 
that are active in other solvents will be of great- use. 
One example of such a scheme is the evolution of a para- 
nitrobenzyl esterase to remove protecting groups from an 
intermediate in ioracarbef synthesis (Moore, J.C. and 
35 Arnold, F.K. Mature Biotechnology 14:458-467 (1996)). 

In this case alternating rounds of error-prone PCR and 
colony screening for production of a fluorescent reporter 
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from a substrate analogue were used to generate a mutant 
esterase that was .16 -fold more active than the parent 
molecule in 30% dimethylf ormamide . No., individual 
mutation was found to contribute more than a 2- fold 
increase" in activity, but- it was the combination of a. 
number of mutations which led to' the overall increase. 
Structural, analysis of the mutant protein showed that the 
amino acid' changes were distributed throughout the length 
of the protein in a. manner that *could not have been 
rationally predicced. Sequential rounds of error-prone 
PGR have the problem that after each round all but one 
mutant is discarded, with a concomitant loss of 
information contained in all the. other beneficial 
mutations. Recursive sequence recombination avoids this 
problem, and would thus be ideally suited to evolving 
enzymes for catalysis in other solvents, as well as in 
conditions where - sal t . concent rat ions or pH were- different 
from the original enzyme cptimas. ' 

In addition, the yield of almost any metabolic 
pathway can be increased, whether consisting entirely of • 
genes endogenous to the host organisms or all or partly 
heterologous genes, . Optimization of the express ion ■ 
levels of the enzymes in a_ oathway is more complex than-' 
simply maximizing expression. . In " some, cases regulation, 
ratner- than, constitutive expression of-.an enzyme* may be 
advantageous for cell growth and therefore for product 
■yield; ~a.s Tre"en-Tbr"prb^ '* 
et ai * Ann.. NT Acad. Sci. 589:T£-?4 (1930)) and 2-keto-L- 
. gluconic acid (Anderson et al.'U.S. 5,032,514)., In 
'addition, it is often advantageous for industrial', 
purposes to express proteins in organisms other than 
their original hosts. New host strains may be' preferable " 
for a variety of reasons, including ease of -cloning and 
transformation,; pathogenicity,, ability to survive in 
'particular environments and a knowledge of -the ' physiology 
and genetics of the organisms'. However,- proteins 
expressed in -heterologous organisms often show markedly 
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reduced activity for a variety of reasons including 
inability to fold properly in the new host (Sarthy et al. 
Appl. Environ. Micro. 53:1996-2000 (1987)). Such 
difficulties—can indeed be overcome by the recursive 
sequence recombination strategies, of the instant 
invention.. 

A '. Antibiotics 

The range of natural small molecule- antibiotics 
includes but' is not limited to peptides, pept idolactones , 
thiopept ides , beta-lactams, glycopept ides / lant ibiotics , ■ 
microcins, polyket ide-derived antibiotics (anthracycl ins , 
t etracyclins , macrolides, avermectins, polyethers and '* 
ansamycins) , chloramphenicol, aminoglycosides,' 
aminocycl i tols , polyoxins, agrocins and isoprenoids. 

There, are' at least three ways ' in - which 
recursive sequence rec omb ;r,a: : or. techniques of the 
instant' invention can be used :o, facilitate novel drug' 
synthesis, or to improve .biosynthesis of existing 
ant ibiot ics . ' . 

First, antibiotic synthes.is enzymes can be 
"evolved" together with transport systems that allow 
entry -of compounds used as antibiotic precursors to 
improve uptake and incorporation of function-altering 
artificial sice chain precurscrs. For example, 
oenicillin V i,s produced by feeding Penicilliusn the' 
artificial side chain precursor phenoxyacet ic acid, and 
LY 14 6 0 3 2 bv feeding Streptomyces roseosporus decanoic 
acid (Hopwood, Phil. Trans. R . Soc . Lond . 3 324:549-562 ' 
(1989) )\ Poor precursor uptake and poor incorporation by 
the . s yn t he s iz i ng en: ym e often lead to inefficient 
formation of. the desired .product . Recursive sequence 
recombination of these two cyctems can' increase the yield 
of desired product . 

Furthermore, a combinatorial approach can be' 
taken in which an enzyme is snuffled for novel catalytic 
-activity/substrate recognition (perhaps by including 
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randomizing oligonucleotides in key positions such as the 
active site) A number of different substrates (for 
example, analogues of side chains that are normally 
incorporated into the antibiotic) can then be tested in 
combination with all the different enzymes and tested for 
biological activity. In this embodiment, plates are made 
containing different potential antibiotic precursors 
(such as the side chain analogues) . The microorganisms 
containing the shuffled library (the library strain) are ' 
replicated onto those .plates , together with a competing, 
antibiotic sensitive, microorganism (the indicator 
strain) J Library cells that are able to incorporate the 
new side chain to produce an effective antibiotic will 
thus be able to compete with the indicator strain, and 
will be selected for. 

Second, the expression of heterologous genes 
transferred from one antibiotic synthesizing organism to 
another can be optimized. The newly introduced enzyme (s) 
act on secondary metabolites in the host cell, 
transforming them into new compounds with novel 
properties. Using traditional methods, introduction of 
foreign genes into antibiotic synthesizing hosts has 
already resulted m the production of novel hybrid 
antibiotics. Examples include meder rhcdin , 
ci nycrog r anat i rhodi n , 5 - deoxye rv t hromyc in A, 
isovaleryispiramycir. and other Hybrid mac rol ides (Cameron 
- rv a *"- Aobl . 3ibchem Siotechhol " '38*: 105-140 "('19 93 )) 
The recursive sequence recombination techniques of the 
instant invent ion- can be used to optimize expression of 
the foreign genes, to stabilize the enzyme in the new 
host cell'-, and to increase the activity of the introduced 
enzyme against its new substrates in the new host cell. 
In some embodiments of the invention, the host genome may 
also be so optimized. 

Third, the substrate specif icicy of an enzyme 
involved in secondary metabolism can be altered so that ■ 
it will act on and modify a new compound or so that its 
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activity is changed and it acts at a different subset of 
positions of its normal substrate. Recursive sequence 
recombination can be used to alter the substrate 
specificities of enzymes. Furthermore, in addition to 
recursive sequence recombination of; individual enzymes 
being a strategy to generate novel antibiotics, recursive 
sequence recombination of entire pathways, by altering 
enzyme ratios, will alter • metabolite fluxes and may 
result, not only in increased antibiotic synthesis, but 
also in the synthesis of different antibiotics . This can 
be deduced from the observation that expression of 
different genes from the same cluster in a foreign host 
leads to different product s " being formed (see p. 80 in 
Hutchinson et. al . , .(1991) Ann NY Acad Sci , 646:78-93). 
Recursive sequence recombination of -the introduced, gene 
clusters may result in a variety or expression levels of 
different proteins within. the cluster (because it 
produces different combinat ibns' of f in this case 
regulatory, mutations)'. This in .turn may lead to a 
varietv of different end products. .Thus, "evolution" of 
an existing antibiotic synthesizing pathway could be used 
to generate novel antibiotics, either by modifying the 
rates or .substrate specificities of enzymes in that 
pathv/ay. 

Addit io.nallv, antibiotics can also be produced 
in vicr.o by the action of a purified enzyme on a 
orecurso'r. For example i sopenici i I in N synthase . * 
catalyses the cyclization of many analogues .of its normal 
substrate' (d- ( L - a - arainoad ipyl ) - L- cysteinyl- D -valine) 
(Hutchinson , Med.' Res . Rev . 8 : 557 - 567 * ( i 93 8 ) ) . Many of 
these products are active as antibiotics. -A wide 'variety 
of substrate analogues can be tested for incorporation by 
secondary metabolite synthesizing enzyrr.es without concern 
for the initial 'efficiency c: the reaction. 'Recursive 
sequence recombination can be used subsequently to 
increase the rate of reaction with a promising new 
substrate . 
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Thus, organisms already producing 'a desired 
antibiotic can be evolved with the recursive .sequence 
recombination techniques described above to maximize 
production of that antibiotic. Additionally, new 
antibiotics can be evolved by manipulation of genetic 
material from the host by the recursive sequence 
recombination techniques described above. Genes for 
antibiotic production can be transferred to a preferred 
host after cycles of recursive sequence recombination or 
can be evolved in the preferred host as described above. 
Antibiotic genes are generally clustered and are often 
positively regulated, making them especially attractive 
candidates for the recursive sequence, recombination 
techniques of the instar.t invention. Additionally, some 
genes of related pathways show cress -hybridization, 
making then preferred candidates for the generation of 
new pathways for new antibiotics by the' recursive 
sequence recombination- techniques of the invention. 
Furthermore, increases ir. secondary metabolite production 
including enhancement of substrate fluxes (by increasing 
the rate of a rate limiting en.z-.~-, deregulation of the ■ 
pathway by suppression of negative control elements or - . 
ever expression of activators .and the relief of feedback 
controls by mutation of the regulated ' enzyme to a 
feedback- insensitive deregulated protein) can be achieved 
oy recursive sequence recombination without exhaustive 
analysis of "the regulatory mechanisms governing 
expression of the relevant cene clusters. 

The nost chosen for expression of evolved genes 
is preferably resistant to the ant ibiot ic .produced , . 
although in some instances • product ion methods can be 
designed so as to sacrifice nost ceils when the amount of 
antibiotic produced is commercial ly .significant yet 
lethal to the host. Similarly., bioreactors can be 
designed so that the growth medium is continually 
replenished, thereby "drawing off" antibiotic produced 
and sparing the lives of the producing cells. 
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Preferably, ■ the mechanism of resistance is not the 
degradation* of the antibiotic produced. 

Numerous screening methods for increased 
antibiotic expression are known in the. art, as discussed 
5 . above, including screening for organisms that are more 

resistant to the antibiotic that they produce.. This may 
result from linkage between expression of the antibiotic 
synthesis and antibiotic resistance genes (Chater, 
Bio/Technology 8 : 115-121 . (19.90)-) . Another screening 
10 method is to .fuse a reporter gene (e.g. xylE' from the 
Pseudomonas TOL plasmid) to the antibiotic production 

■ genes. Antibiotic . synthesis gene expression can then be 
measured' by looking for expression of the reporter (e.g. 
xylE encodes a catechol dioxygenase which produces yellow, 

15' muconic semia ldehyde 1 when colonies are sprayed with 

catechol .' (Zukowski e: al. Proc ... Natl. Acad. Sci. U.S.A. 
80:1101-1105 (1993)). 

, The wide variety, zi cloned antibiotic genes 
provides a wealth of starting materials for the recursive 

20 sequence recombination techniques of the instant 

invention. For, example, genes have been cloned from 
S creptorryces cattleya which direct cephamycin C .synthesis 
in the nor. - ant ibict i c producer Screpccmyces 1 iv'idans 
(Chen et al . 3.io /Techno i osv 6:1222-1224 (198-8)). 

2'5 Clustered genes f or ■ penic 11 1 in biosynthesis 

id- ( L-a-am'inoadipyl) - L - cyste inyl - D -valine synthetase; 
isopenicil 11 n N T - synthetase' and a cy 1 ■ coenzyme 
A: 6 -aminopenicillanic acid a.cyltrans f erase ) have been 
cloned from Penic i 1 1 i urn chrysogenum . Transfer of these 

30 genes into Heurospora crassa and Aspergillus' niger result 

in the synthesis of active penicillin V (Smith et al. 
Bio/Technoloov 8:39-41 (1930)). For a review of cloned 
genes involved in- Cephalosporin C, Penicillins G and V 
and Cephamycin C biosynthesis, see Piepersberg , Cr it . - 

35 Rev . Biotechnol . 14:251-285 (1994) . For a review of 

cloned clusters of antibiotic - producing genes, see Chater 

■ 3io/Technolocv 8:115-121 (1990). Other examples of 



WO 97/35966 



POYUS97/04715 



10 



53 

antibiotic synthesis genes transferred to industrial 
producing strains, or over expression of genes, include 
tylosin, cephamycin O, cephalosporin C, LL-E33288 complex 
(an antitumor and antibacterial agent) , doxorubicin, 
spiramycin and other macrolide antibiotics, reviewed in 
Cameron et al. AddI. Bi ochem. Biotechnol. 38:105-140 
(1993). 

B - Biosynthesis to Replace rhemical Synthesis 

of Antibiotics 
Some antibiotics are currently made by chemical 
modifications of biologically produced starting 
compounds. Complete biosynthesis of the desired 
molecules may currently be impractical because of the 
15. lack of an enzyme with the required enzymatic activity 
and substrate specificity. For example, 
7-aminodeacetcoxycephalosporar.ic acid (7-ADCA) is a 

■ precursor for semi -synthet i cai iy produced cephalosporins. 
7-ADCA is made by a chemical ring expansion from ' 

20 penicillin v followed uy enzymatic ceacylation of the 

phenoxyacetai group. Cephalosporins could in. pri.nc.iole 
be produced biologically fro- their corresponding 
penicillins (e.g., cephalosporin V or G from penicillin V 
or G) using penicillin !•' expandase, but other penicillins 

25 (S - Ch as Penicillin v cr G ) are not used as substrates by 
:---own expancases. The recursive sequence recombination 
techniques -of- the- invent i"5n can "be "used" to' alter the" ' 

■ enzyme so that it will use penicillin V as a substrate. 
Similarly, penicillin transacvlase could be so modified 



3 0 



3 5 



or. 



to accept cephalosporins or cephamycins as substrates. 

In yet another example, penicillin amidase 
expressed in E. call is a key enzyme in- the production 
penicillin G derivatives. 7,c- enzyme is generated from a 
precursor peptide and tends v.: accumulate as insoluble 
aggregates in . the periplasm- unless non-metabolizable 
sugars are present in the r-.ccium (Scherrer et al. _Aod^ 
Microbiol Biotechnol. 42:85-91 (1994)). Evolution of 
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. this enzyme through the methods of the instant invention 
could be used to -generate an enzyme that folds better, 
leading to a higher level, of active enzyme expression. 

In yet' another example, ■penicillin G acylase 
covalently linked to agarose is used in the synthesis of 
penicillin G derivatives.' The enzyme can be stabilized 
for increased activity, longevity and/or thermal 
stability by chemical modification {Fernandez-Laf uente 
et, al. ' Enzyme Microb, Technol . 14:489-495 (1992) . 
Increased thermal stability is an especially attractive 
application of the recursive sequence recombination 
techniques . of the instant invention, which can obviate 
the need for the chemical modification of such enzymes. 
Selection for thermostability car. be performed in vivo in 
E. coli or in thermophiles at higher temperatures. In 
general, thermostability is a coed first step in 
enhancing general stabilization of enzymes. Random 
'mutagenesis- and" select ion can also, be used to adapt 
enzymes to function in non-aqueous solvents (Arnold Curr . 
Ooin Blo.technol , 4.:450-455 -(1 993) ; Chen et. -al .' Proc . , 
Natl. Acad. Sci . U.S.A. •/ 90 : 56 16 - 5622 (1993) ) . Recursive 
sequence' recombina t ion represents a more powerful (since' 
recombinogeni.e ) method of generating mutant -enzymes that, 
are stable and active in nor. -aqueous environments. 
Additional screening car. 'be done- on the basis of enzyme 
stability- in solvents. . . 

C. Polyke t ides - \ 

Polyketides include 'antibiotics such a.s 
tetracycline '.'and erythromycin,' an t i - canter agents such as 
daunomycin, immunosuppressants such as .Fro 06 and 
rapamycin and veterinary products such as monesin and 
avermectin.* Polyketide synthases (PKS's) are 
multifunctional enzymes that control the chain length, . 
choice of chain-building units and reductive cycle that 
generates the huge variaticn in naturally occurring 
polyketides. Polyketides are built up by sequential 
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transfers of "extender units" (fatty acyl CoA groups) 
onto the appropriate starter unit (examples are acetate, 
coumarate, propionate and malonamide) . The PKS's 
determine the number of condensation reactions and the 
type of extender groups added and may also fold and 
cyclize the polyketdde precursor. PKS's reduce specific 
0-keto groups and may dehydrate the resultant /?-hydroxyls 
to form double bonds. Modifications of the nature or 
number of building blocks used, positions at which 0-keto 
groups are reduced, the extent of reduction and different 
positions of possible cyclizations , result in formation 
of different final products. Polyketide research is 
currently focused on modification and inhibitor studies, 
site directed mutagenesis and 3-D structure elucidation 
to lay the groundwork for rational changes in enzymes 
that will lead to new polyketide products. 

Recently, McDaniel et al. ( Science 262:1546- 
1550 (1995)) have developed a S zrepcomyces host-vector 
system for- efficient construction and expression of .i 
recombinant PXSs . Hutchinson ( = io/Techno1ncv 12:375-308' .. 
(1994)) reviewed targeted mutation of specific 
biosynthetic genes and suggested that microbial isolates "-• 
can oe screened by D.'.\-\ hybridization for genes associated'-' 
with known pharmacologically active agents so as to 
provide new metabolites -or increased yields of 
metabolites already being produced. In particular, that 
review focuses ' on' pblyket ice synthase and pathways to" 
aminoglycoside and oligopeptide antibiotics. 

The recursive sequence recombination techniques 
of the instant invention can be used to generate modified 
enzymes that produce novel polyketides without such 
detailed analytical effort. The availability of the PKS 
genes on plasmids and the existence 'of £ . col'i- 
Streptomyces shuttle vectors (Wehmeier Gene 165: 14 9-150 
(1995)) makes the process of recursive sequence 
recombination especially attractive by the techniques 
described above. Techniques for selection of antibiotic ' 
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producing organisms can be used as described above; 
additionally, in some embodiments; screening for a 
particular desired polyketide activity or compound is ; 
preferable. 

D. Isoprenoids 

Isoprenoids result from cyclization of farnesyl 
pyrophosphate by sesquiterpene synthases . '; The diversity 
of isoprenoids is generated not by the backbone, but by 
control of cyclization. Cloned examples of isoprenold .• ■ 
synthesis genes include trichodiene synthase from 
Fusarium sprorozrichioides pentalene synthase from 
Streptomyces , aris tolochene synthase from Penicill ium 
roquefortii, and epi -aristoiochene synthase from N. 
tabacum (Cane, D.E.' (1995). Isbprenoid antibiotics/ 1 
pages 633-655, in "Genetics and Biochemistry of 
Antibiotic Production" edited oy Vining, L..C. Sc.. Stuttard, 
C. , published by 3utterwort h -He memann) . Recursive 
sequence recombination of -sesquiterpene synthases will be 
of use both in allowing expression of these enzyrr.es. in 
heterologous hosts (such as .planes and industrial, 
microbial strains) and in alteration of enzymes to 
change the eye 1 1 zed product -made. A large number of 
isoprenoids are a'ct.ive as antiviral antibacterial, 
antifungal, herbicida'i, ■ insecticidai or cytostatic 
agents. Antibacterial and antifungal isoprenoids could 
thus be preferably screened cor using the indicator cell 
type system, described above, with the producing, cell 
competing with bacteria or fungi, for 'nutrients '. 
Antiviral, isoprenoids could be screened for preferably by 
their 'ability to confer resistance 'to viral attack on. the 
producing ceil. 

E . Bi'oactive Peptide Derivatives 
Examples of bioactive non - r ibosomal ly 

synthesized peptides include the antibiotics cyclosporin, 
pepstatin, act inomycin, gramicidin , depsi pep tides , 



WO 97/35966 PCT/US9704715 



57 



vancomycin, etc. Theie peptide derivatives are 
synthesized by complex -enzymes rather than ribosomes. 
Again, increasing the yield of such non-ribosomally 
synthesized peptide antibiotics has. thus far been done by 
genetic identification of biosynthetic "bottlenecks" and 
over expression of specific enzymes (See, for example, p. 
133-135. in "Genetics and Biochemistry of Antibiotic 
Production" edited by Vining, L.C. & Stuttard, C. , 
published by Butterworth-Heinemann) . Recursive sequence 
recombination of the enzyme clusters can be used to 
improve the yields of existing bioactive non-ribosomally 
made peptides in both natural and heterologous hosts. 
Like polyketide synthases, peptide synthases are modular 
and multifunctional enzymes catalyzing condensation 
reactions between activated building blocks (in this case 
amino acids) followed by modifications of those building..' 
blocks (see Kleinkauf, H. and von Dohren, K. Eur. J. k 
3iochgr? - 236:335-351 (1996)). Thus, as for polyketide - 
synthases, recursive sequence recombination can also be..^. 
used to alter peptide synthases: modifying the 
specificity of the amino acid recognized by each binding-, 
site on the enzyme and altering the activity or substrate, 
specificities of sites that modify these amino acids to v. 
produce' novel compounds with antibiotic activity. 

Otner peptice antibiotics are made ribosomally 
and then .post - transiat ionaliy modified. Examples of 'this 
type of antibiotics are i.an.tibiot ics (produced by gram" - 
positive bacteria such Staphylococcus, Szrepconyces , 
3acillus, and Accinoplar.es) and microcins (produced by 
Enterobacteriaceae) . ' Modifications of ' the original 
peptide include (in iantibiotics) dehydration of serine 
and threonine, condensation of cehydroamino acids .with 
cysteine, or simple N - and C- terminal blocking 
(microcins). For ribosomally made antibiotics both the 
peptide-encoding sequence and the modifying enzymes may 
have their expression levels modified by recursive 
sequence recombination. Again, this will lead to both 
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increased levels of antibiotic synthesis, and by 
modulation of the levels of the modifying enzymes (and 1 
the sequence of the ribosomally synthesized peptide 
itself), novel antibiotics. 

Screening can be done as for other, antibiotics 
as described above , ■ including competition with a 
sensitive (or even initially insensitive) microbial 
species. Use of competing bacteria that have resistances 
to the antibiotic being produced will select strongly 
either for greatly elevated levels of that antibiotic ■ (so 
that it swamps out the resistance mechanism) or for novel 
derivatives , 6f that antibiotic that are' not neutralized 
by the resistance mechanism. 

F . Polymers 1 

Several examples c: metabolic engineering to 
produce biopolymers have beer, reported, including the 
production of the- biodegradable plastic 

polyhydroxybutarate (?K3), and' the polysaccharide xanthan 
cum. For a review, see Cameron et al . Aoolied 3iochem. 
Biotech . 3 8 : 105 - 14 0 ( i S 3 ; 3 ) .Genes for these pathways 
have been cloned"/ making them excellent candidates .for 
the recursive 'sequence recombinat ion techniques described 
above. Expression of such, evolved genes in a 
commercially viable host such as E. coli is an especially 
attractive application o: this technology . 

Examples of starting materials for recursive 
sequence recomubinat ion include but are not limited to 
aenes from bacteria such 'as Alcaligenes , Z'oogloea, 
Rhi zobiun}, Bacillus ,: .and A-zocac cer , which produce 
poiyhydroxyalkanoates (?HAs) such as polyhyroxybuty rate 
* (?HB) ihtracellularly as energy reserve materials in . 
response to stress. Genes 'f rem Alcaligenes eucrdphus . 
that encode enzymes catalyzing .the conversion of 
acetoacetyl CoA to PH3 have been transferred both to E. 
coli and to the plant Arab: zcpsis chaliana 
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(Poirier et al. Science 256:520-523 (1992)). Two of these 
genes (phbB and phbC, encoding acetoacetyl-CoA reductase 
and PHB synthase respectively) allow production of PHB' in 
Arabidopsis. The plants producing the plastic are 
stunted, probably because of adverse interactions between 
the new metabolic pathway and the plants' original 
metabolism (i.e., depletion of substrate from the 
mevalonate pathway) . Improved production of PHB in 
plants has been attempted by localization of the pathway 
enzymes to organelles such as plastids. Other strategies 
such as regulation of tissue specificity, expression 
timing and cellular localization have been suggested to 
solve the deleterious effects of PHB expression in 
plants. The recursive sequence recombination techniques 
of the invention can be used to modify such heterologous 
genes as well as specific cloned interacting pathways 
(e.g., mevalonate), and :c optimize PHB synthesis in 
industrial microbial strains, for example to remove the 
requirement for stresses (such 35 nitrogen limitation) in 
growth conditions. 

Additionally, ocner microbial polyesters are 
mace by different bacteria ;:; which additional monomers ■ 
are incorporated into the polymer {Peoples et ai. in 
Novel Biodegradable y.i crobi a Polymers , EA Dawes, ed . , 
PP191 - 202 (1990)),. Recursive sequence recombination cf 
these genes or pathways singly or in combination into a 
r.e-t-e ro-loccus host will allow the production of- a variety ■ 
o z pel yme rs with differing properties, including 
variation of the monomer subunit ratios in the polymer , • 
Another polymer whose synthesis may be manipulated by 
recursive sequence recombination is cellulose. The genes 
for cellulose biosynthesis have been cloned from 
Agrobac Ceri um cumefaci er.s (Mat t hysse , A . G . et . al . J . 
Bacteriol . 177:1069-1075 ; 1 9 ? 5 } ) . Recursive sequence 
recombinat ion of this biosynthetic pathway could be used 
either to increase synthesis of cellulose, or to produce 
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mutants in which alternative sugars are incorporated into 
the polymer. 

G. Carotenoid3 
, Garotenoids are a- family of over 600 terpenoids 
produced in the general isoprenoid biosynthetic pathway 
by bacteria, fungi and plants (for a review, see 
Armstrong, J. Bact . 176:4795-4802* (1994)). These 
pigments protect organisms against photooxidat ive damage 
as well as functioning as anti-tumor agents, free 
radical -scavenging anti -oxidants , and enhancers of the 
immune response. Additionally, they are used 
commercially in pigmentation of cultured fish and 
'.shellfish. ■ Examples of carotenoids include but are not 
limited to myxobacton, sphero'idene , spheroidenone , 
lutein, astaxanthin, violaxanthin , 4 - ketoruiene , 
'myxoxanthrophyl'l , echinencne., iycopene, zeaxanthin and 
its mono- and ci- clucosices, ci- 4 -Q~, y- and 6-carotene, 
^-cryptoxanthin monogiucos ide and • neoxan-t-hin . ■ ■ 

Carpter.oic synthesis .is catalyzed by relatively 
small numbers of clustered ger.es: 11 different genes 
within 12 .kb 'of ON A: fro™ Xyxc coccus' xar. chus (Bote 11a et 
5 — Eur. J. 3ibchem. 233:238-24 8 ( 1.995}) and 3 genes 
within 9 kb of DNA from Rhcdcbaccer sphaeroides (Lang.-'et. 
a'l'.. J ... 3a.cz . 177 : 2064 -2073 (1 995);. In some 
microcrganisms^ such as Therr.us the rrn oph i lus f these genes 
ar'e. plasmid-borne (Tabata et al . FES'S Letts 341:251-2,55 
(' 19 9 4 > " ) . These . features 1 make carotenoid ' synthetic 
pathways especially attractive candidates' for .recursive, 
sequence ' recorabina t-icn . •" 

Transfer 'of some carotenoid genes into 
heterologous organisms results in exoression. " For 
example, genes from Erwina uredovora and Haema Lococcus 
pluvialis will function together in E . cbli (Kajiwara et 
si. Plant Mol . Biol . 29:343-352 (1995)). E. herbicola 
genes will function in R. sphaeroides (Hunter et al . J . 
Sact . 176:3692-3697 (1994)). However, some other genes 
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do not; for example," .R. capsulatus genes do not direct 
carotenoid synthesis in E. coli (Marrs, J, Bact. 
146:1003-1012 (1981) ).■'"' 

In an embodiment of the invention, the 
recursive sequence recombination techniques of the 
invention can be used to generate variants in the 
regulatory and/or structural elements, of genes in 'the 
carotenoid synthesis pathway, allowing increased 
expression in heterologous hosts. Indeed, traditional 
techniques have been used to increase carotenoid 
production by increasing expression of a rate limiting 
enzyme' in Therwus thehnophilus (Hoshino et al . add! . 
Environ. Micro 59:3150-3153 (1993)). Furthermore, 
mutation of regulatory, genes can cause constitutive 
expression of carotenoid synthesis in accinomycetes , 
where carotenoid photoinducibility is otherwise unstable 
and lost at a relatively high frequency in some species 
(Kato et al. Mol . Gen. G*r*~ 2 \1 :387-390 (1995)). These 
are both mutations that can be obcained by recursive 
sequence recombination. 

The recursive sequence recombination" techniques 
of the invention as described above can be used to evolve 
one or more carotenoid synthesis genes in a desired host 
without the need for analysis of regulatory mechanisms. 
Since carotenoids are colored, a colorimetric assay in 
microtiter plates, or even on growth media plates, can be 
used--for- screening for increased production." 

In addition to increasing expression of 
carotenoids, carotenogenic biosynthetic pathways have the 
potential to' produce a wide diversity of carotenoids, as 
the enzymes involved appear to be specific for the type 
of. reaction they will catalyze ,\ out not for the substrate 
that they modify. For example, two enzymes from the 
marine bacterium Agrobacceriv- auranciacun (CrtW and 
CrtZ) synthesize six different ketocarotenoids from 
0-carotene (Misawa et ai . J. 3acr. . 177:6576-6584 (1995)) . 
This relaxed substrate specificity means that a diversity 
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of substrates can be transformed into an even greater 
diversity of products. Introduction of. foreign 
carotenoid genes into' a cell can lead to novel and . 
functional carotenoid-protein complexes, for example in, 
photosynthetic complexes - (Hunter et al . J; Bact . 176:3692- 
3697 (1994)). Thus, the deliberate recombination of 
enzymes through the recursive sequence recombination 
techniques of the invention is likely to generate novel 
compounds. Screening for such compounds can be 
accomplished, for. example, by the cell 

competition/survival techniques discussed above- and by a 
colorimetric assay for pigmented compounds. 

Another method of identifying new compounds is 
to use standard analytical techniques such as mass 
spectroscopy, nuclear magnetic resonance, high 
performance liquid chromatography, etc. Recombinant 
microorganisms can be ■ pooled, and extracts or media 
supernatant s assayed from bhese pools. Any positive pool 
can then be subdivided and the procedure repeated until 
the, single positive is identified ( "sib -select ion" ) . 

* Indigo Biosynthesis - 

Many dyes,' i.e. agents for • impart ing color, are 
specialty chemicals with significant markets. As an 
example, indigo is currently produced . chemica 1 ly . 
However, nine genes have been combined in E. col : to 
allow the synthesis of indigo, from glucose via the 
tryptophan/ indole pathway (Murdoch et a,l . Rio/Technology 
11:. 381-386 (1993)). A "number of manipulations were 
performed to optimize indigo- synthesis: cloning of nine 
genes, modification of the fermentation medium and 
directed changes in two cperons to increase reaction 
rates and catalytic activities of several enzymes. 
Nevertheless, bacteria 1 ly produced indigo is not 
currently an economic proposition. The recursive 
sequence recombination techniques of the instant 
invention could be used to optimize indigo synthesizing 
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enzyme expression levels and catalytic activities, 
leading to increased indigo production, thereby making 
the process commercially viable and reducing the 
environmental impact of indigo manufacture. Screening 
for increased indigo production can be done by 
colorimetric assays of cultures in microtiter plates. 

I . Amino Acids . 

Amino acids of particular commercial imoortance 
include but are not United to phenylalanine, monosodium 
glutamate, glycine, lysine, threonine, tryptophan and 
methionine. Backmanetal. ( Ann. NY AraH o< » fl . 1ff ^ 
(1990)) disclosed the enhanced production of 
phenylalanine in E . coli via a systematic and downstream 
strategy covering organism -selection, optimization of 
biosynthetic capacity, ar.d development of fermentation 
and recovery processes. 

As described i.-. Simpson et al . Qiochem fin'r . ' 
Trans, 23:331-387 (1995)), current work in the field of \- 
amino acid production is focused on understanding the 
regulation of' these pathways in- great ■ molecular detail. 
The recursive sequence recombination techniques of the 
instant' invention would obviate the need for this 
analysis to obtain bacterial' strains with higher secreted - 
amino acid yields. Amino acid production could be 
optimized for expression using recursive sequence 
recombination of the amino acid synthesis and secretion 
•genes as well as enzymes at the regulatory 
phosphoenolpyruvate branchpoint, from such organisms as 
Serracia warcescens, Bacillus, and the Corynebacteriua 
-Brevibacterium group. In some embodiments of the 
invention, screening for enhanced production is 
preferably done in microtiter wells, using chemical tests 
well known in the art that are specific for the desired 
amino acid. Screening/selection for amino acid synthesis 
can. also be done by using auxotrophic reporter cells that 
are themselves unable to synthesize the amino acid in ' ^ 
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.question. If these reporter cells also produce a 
compound that stimulates the growth of the amino acid 
producer (this could be a growth factor, or even a 
different amino acid) , then library cells, that produce 
more amino acid will in turn receive more growth 
stimulant and will therefore grow more rapidly. 

J . Vitamin C synthesis 

L-Ascorbic acid (vitamin C) is- a commercially . 
important vitamin with a world production of over 35,000 
tons in 1984. Most vitamin C is currently manufactured 1 
chemically by the Reichstein process, although recently 
bacteria have been engineered that are able to transform 
glucose to. 2 , 5 -keto-gluconic acid , and chat product to 2- 
ke to- L- idonic acid, the precursor to L- ascorbic acid 
(3oudrant, Snzvme Micro's . Tec. hr.ol . 12:322-329 (1990)). 

The ef f iciehcies o: these enzymatic steps in.' 
bacteria are currently low. Using, the recursive sequence 
recombination techniques of the 'instant invention, the 
genes can be genetically er,c in eered to create one or more 
opercns followed by expression optimization of such a 
hybrid L- ascorbic acid synthetic pathway to result in 
commercially viable microbial vitamin C biosynthesis. In 
some embodiments, screening fcr enhanced L- ascorbic acid 
production is preferably done in microtiter plates, using' 
as'says well known in 'the art. 

V 1 • Modification of Cell Properties. ' 

Although not strictly examples of manipulation 
of intermediary metabolism, recursive sequence* ' . 
recombination techniques can be used to improve or alter, 
other aspects of cell properties, from growth rate, to 
ability to secrete certain desired compounds to ability 
to tolerate increased temperature or other environmental 
stresses. Some examples of traits engineered by 
traditional methods include expression of heterologous 
proteins in bacteria, yeast, and other eukaryotic cells, 
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antibiotic resistance, and phage, resistance . Any of 
these traits, is advantageously evolved by .the recursive 
sequence recombination techniques of the instant 
invention. Examples include replacement of one nutrient 
5 uptake system (e.g. ammonia in Methylophilus 

methylotrophus) with another that is more energy 
efficient; expression of haemoglobin to improve growth 
under conditions of limiting oxygen; redirection of toxic 
metabolic end products to less toxic compounds; 
10 expression of genes conferring tolerance to salt, drought 
and toxic compounds and resistance to pathogens, 
antibiotics and bacteriophage, reviewed in Cameron et. 
al. Apol Biochem Biotechnol , 38:105-140 (1993). 

The heterologous genes encoding these functions 
15 all have the potential for further optimization in their 
new hosts by existing recursive sequence recombination 
technology. Since these functions increase cell growth 
rates under the desired growch conditions, optimization 
of, the genes by evolution simply involves recombining the^ 
20 DN'A recursively and selecting the recombinants' that grow 
faster with limiting oxygen, higher toxic compound 
concentration, or whatever is the appropriate growth 
condition for the parameter being improved. 

Since these functions increase ceil growth 

2 5 rates under the .desired growth conditions, optimization 

of the genes by "evolution" can simply involve 
"s'huf fling " the DK A and" selecting the "recombinants that 
grow faster with limiting oxygen, higher toxic compound 
concentration or whatever restrictive condition is being 

3 0 overcome . 

Cultured mammalian cells also require essential 
amino acids to be . present in the growth medium. This 
requirement could also be circumvented by expression of 
heterologous metabolic pathways that synthesize these 
35 , amino acids (Rees et al . Biotechnology 8:629-633 (1990). 
Recursive sequence recombination would provide a 
mechanism for optimizing the expression of these genes in 
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mammalian cells. Once again, a preferred selection would 
be for cells that can grow in the absence of added amino 
acids . 

Yet another candidate for- improvement through 
the techniques of the invention is symbiotic nitrogen 
fixation. Genes involved in nodulation (nod, ndv) , 
nitrogen' reduction (nif,. fix)-, host range determination 
(nod, hsp) , bacteriocin production . (tfx) , surface 
'polysaccharide synthesis (exo) and energy utilization 
-(dot, hup) which have been identified ( Paau , Biotech . 
Adv . 9:173-184 (1991)). 

The main function of recursive sequence 
.recombination in this case is in improving the survival 
of strains that are already known to be better nitrogen 
fixers. , These strains tend to, be less good a: competing 
with strains - already, present: in the environment,' even 
though they are better ' at . nitrogen fixation.; 'Targets for 
recursive' sequence recombination such as nodulation and 
host range determination, genes can be modified 'and 
selected for' by their ability , to grow on the new host. 
Similarly' any bacteriocin or energy utilization' genes 
that will improve, the competitiveness of. the 'strain will 
also result in greater growth rates. Select ion can . 
simply be performed by subjecting the target genes to ■ 
recursive secruence • reccmbinat ion and forcing' the 
inoculanf ■ to compete with wild type nitrogen fixing 
bacteria.' The better, the nitrogen fixing bacteria grow ■ 
in the new host, the more, copies' of their recombined 
genes will be. present for the next round of 
recombination! This growth rate ,. differentiating 
selection is described above in detail. 

VI , Biode tec tors /Biosensors 

Bioluminescence cr fluorescence genes can be 
used as reporters by fusing them to specific regulatory 
genes (Cameron et. al . Aool Biochem Biotechnol . , 38 : 105- 
140 (1993)). A specific example is one in which the 
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lucif erase genes luxCDABE of Vibrio fischeri were fused 
to the regulatory region of the isopropylbenzene 
catabolism operon from Pseudomonas putida RE204. 
Transformation of this fusion construct into E. coli 
resulted in a strain which produced light in response to 
a variety of hydrophobic compound such as substituted 
benzenes, chlorinated solvents and naphthalene 
(Selifonova et. al., Aool Environ MicmMoT oo7a. 7ft i 
(1996)) . This type of construct is useful for the 
detection of pollutant levels, and has the added benefit 
of only measuring those pollutants that are bioavailable 
(and therefore potentially toxic) . Other signal 
molecules such as jellyfish green fluorescent protein 
could also be fused to genetic regulatory regions that 
respond to chemicals in the environment. This should 
allow a variety of rr.olecules to be detected by their 
ability to induce expression cf a protein or proteins : 
which result in light, fluorescence or some other easily 
detected signal. 

Recursive sequence recombination can be used in 
several ways to modify this type of biodetection system. • 
It can- be used to increase the amplitude of the response, - 
for example by increasing the fluorescence of the green' - 
fluorescent protein. Recursive sequence recombination, 
could also be used to increase induced expression levels 
or catalytic activities cf other signal-generating 
systems, "for example 'of ' the" lucif erase genes. " 

Recursive sequence recombination can also be 
used to alter the specificity of biosensors,. The 
regulatory region, and transcriptional activators that 
interact with this region and with the chemicals that 
induce transcription can also be .shuffled. This should 
generate regulatory systems in which transcription is 
activated by analogues cf the normal inducer, so that 
biodetectors for different chemicals can be developed. 
In this case, selection would be for constructs that are ' 
activated by the (new) specific chemical to be detected. 
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Screening could be. done simply with ' fluorescence (or 
light) activated cell sorting, ' since' the desired 
improvement is in light production. 

In addition to detection of environmental 
pollutants, biosensors can be developed that will respond 
to any chemical for which there are receptors, or for 
which receptors can be evolved by recursive sequence 
recombination, such as hormones, growth factors metals 
and drugs. These receptors may be intracellular and 
direct activators of transcription, or they may be 
membrane bound receptors that activate transcription of 
the 1 signal indirectly, for example by a phosphorylation 
cascade. They may also not act on transcription at all, 
but may produce a signal by ' some ■ post - transcriptional 
modification of a component of the signal generating 
pathway., These receptors may also be generated by fusing 
domains responsible ■ for binding different iigands with' 
different signaling domains. 'Again, recursive sequence 
recombination can be. usee, to increase the amplitude of 
the signal' aenerated to optimize- expression and 
functioning of •chimeric: .-receptors, and to alter the 
specificity of the- chemicals detected ■ by the receptor . 

The following examples are offered' by way of 
illustration, not by way of limitation. 

-EXAMPLES 

I . Alteration of enzyme activity and specifi city. 

In this example, recursive sequence 
Recombination techniques of the -instant invention were 
used to expand 'the range of substrates efficiently . 
hydrolyzed by £ . coli p -ca lactos lease . The goal was to 
evolve wild type £. coli C -caiactosidase into -a 
fucosidase. The enzyme shewed very. weak activity with 
both p-nitrophenyl-/3-D- f uccpy ranosice and 

*o-nitrophenyl-/3-D-fucopyranoside (estimated respectively 
as 80- and 160-fold less efficient than for 
p~nitrophenyl-/?-D-galactopyranoside) . 
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To increase the activity of E. coli 
/?-galactosidase against these f ucopyranoside derivatives, 
a lacZ gene (a 3.8 kb Hind III -BamHI fragment from 
plasmid pCHHO, Pharmacia) encoding. E. coli 
/J-galactosidase was subcloned into plasmid pl8SFI-BLA-SFI 
(Stemmer, Nature, 370 .-389-391 (1994) ) . The resulting 
plasmid, pl8-lacZ, was used for recursive sequence 
recombination and mutant screening. 

Purified plasmid pl8-lacZ (4-5 /xg) was used 
directly for DNase I fragmentation. Fragments with sizes 
between 50' and 200 bp were purified from a 2* agarose gel 
and used for reassembly PCR (Stemmer, Nature 370:389-391 
(1994)). Assembly reactions used Tth ' polymerase (Perkin 
Elmer) in the manufacturer's supplied buffer. The PCR 
program for assembly was as follows: 94 °C, 2 min., then 
40 cycles of 94°C for 30 sec.; 55°C for 3 sec.; 72°C for >*■ 
1 min. - 5 sec. per cycle; -her. finally 72°C for S min. 

This reaction was di luted .100 - fold into a ?■ 
standard PCR reaction using the 40mer primers p50F ■■ i 

5 ' - AGCGCCCAAtACGCAJi^CCGCCTCTCCCCGCGCGTTGGCC - 3 ' 

and pR34 5 ' - CTATGCGGCATCAGAGCAGAT7GTACTGAGAGTGCACCAT- 3 ' . v. 
This resulted in amplification of bozh the desired DN T A J 
band (about 4 kb in size) as well as two smaller sized 
products (about 500b? -and lOObp bands). The PCR produces ' 
were digested with Sar^HI and Hind III and the correct, 
size product was cloned into 3amKI - Kind III digested 
piS-lacZ."" The result ing"*piasmid containing a pool of 
reconbined lacZ mutants was plated out on LB plates 
supplemented, with kanamycin and 

5-bromo-4-chloro-3-indolyl-/?-D-fucopyranoside (X-fuco) . 
Plates were incubated at 37°C for 20 hours and screened 
for .colonies with slight blue tint, indicating hydrolysis 
of the X-fuco. Plasmid DN"A was prepared from positive 
colonies and the procedure was repeated. Thus, six 
rounds of recursive sequence recombination produced a 
ten-fold increase in X-fuco hydrolysis activity. 



WO 97/35966 



PCT/US97/04715 



70 

II.- Evolution'of an Entire Metabolic Pathway 

As an example of evolution of an entire 
metabolic pathway; the recursive sequence recombination 
techniques of the invention were used to* modify, a plasmid 
encoding resistance to mercury sal ts . This plasmid, as 
disclosed by Wang et'al.' ( J. Bact : 17-1:83-92 (1989) ) 
contains at least 8 genes within 13.5 kb of Bacillus DMA 
inserted in the cloning vector pUC9. The recursive 
sequence recombination protocol used for this plasmid was 
as follows . , 

Plasmid DNA (at 130 /ig/ml) was digested with 
0.0 9 U/ml DNAse in 5 0 mM Tris-Cl, pH 7.4, 10- mM MnCl 2 , for 
10 minutes "at 25°C. DNA fragments were not size- 
selected, but were purified by phenol extraction and 
ethanol precipitation. The ' assembly reaction was' 
performed using Tth polymerase (Perkin Elmer) using, the 
manufacturer's .supplied, buffer, 1 supplemented with the 
following: 7.5V polyethylene glycol, 8000 Mw; 3 5 mM 
■te.tr a methyl ammonium chloride; and 4 U/ml Pwb (Boeh ringer 
Mannheim), Pf u (St ratagene ) ( Vent (New England Biolabs) , 
Deep Vent (Mew England'- 3ioiabs ) , Tfi (Promega) or Tli 
(Promega) 'thermostable DNA poi^erases. _ DNA fragments 
were used at around -10 <:g/nu. 

The PGR program, for assembly' was as follows: 
9 4 0 C for 20 sec,, then 40 .cycles of. 94 0 C. for 15 sec,, 
4 0 0 C for 3 0 sec., • 7 2 c C for, 3 0 sec. * 2 sec, /cycle, and 
finally- 72 °G" for 10' min. ' 

( The recombinant: plasmid was then amplified in 
three fragments by using primers flanking the three 
relatively' 'evenly ''spaced AlwNI restriction-, site-s '■ 
contained in the plasmid. ■ The sequences of these primers 
were : 

1) 5' -CAGGACTTATCGGGAGTGGGAGG- 3 ' 

2 ) 5 ' - CTCGCTCTGCTAATCCTGTTACC - 3 ' 

3 ) 5 ' - G G A T AT TAT GAG GG T T T AG G C T T AATTC G - 3 ' 

4 ) ■ 5 ' - CGGT ATCCTTTTTCCGT ACGTTC - 3 ' 

5 ) 5 ' - GTTGAAGAGGTG AAGAAAGTTCTCC - 3 ' 
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6 ) 5 ' - GT7CGTCGATTTCCACGC.TTGGC - 3 ' . 

Three fragments were amplified using primers 
1+4 (6 kb fragment), 2+5 (4 kb fragment) and 3+6 (6 kb 
fragment). These were then digested with AlwNI , gel 
purified and ligated together. As AlwNI is a 
non-palindromic cutter, the plasmid could only reassemble 
in the correct (original) order. The resultant plasmids 
were transformed into E. coli strain DHIOB (Gibco BRL) 
and selected on nutrient agar- containing ampicillin 
SO/ig/ml and increasing concentrations of mercuric 
chloride (IOOjiM to IOOOjxM) or phenylmercuric acetate 
(50MM to 400/.M) Thus, in 2 rounds of recursive seouence 
recombination the tolerance of E. coli to these compounds 
increased by a factor of 10. (from about 100 to about 
1 , 000 fiM) . 



111 ■. Recursive sequ e n ce R ecombi n*r i on of , a Family 

Related Enzymes 

In this exa~.pl e nucleotide sequences were 
recombined between four homologous p - lactamases from C. 
freundii, E. cloacae, K. pr.eu.~or.ia, and Y. 
encerocolicica.. The four genes were synthesized from 
oligonucleotides as described' in Stemmer,. et al ■ G °no 
164:49-53 (1995). Briefly, the entire coding sequences 
of the genes were synthesized as overlapping 50-mer 
oligonucleotides on a commercial oligonucleotide 
synthesizer. The oligonucleotides' were then assembled 
into full length genes by a standard recursive .sequence 
■recombination reaction, followed by amplification 'using 
primers common to all four genes. ' Oligonucleotides were 
designed to give optimal E . coli ccdon usage in the 
synthetic genes with the goal of increasing the homology 
to increase the frequency of recombination, and the same 
5' and 3' terminal sequences. After assembly of the 
genes and selection for active clones, which is optional, 
they were DNase treated to produce fragments from 50 to 
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• 200 bp in length. The fragments were dissolved at 100- 
jig/ml in 15 (jlI of Klenow (DNA polymerase I large 
.fragment) buffer, (New England .Biolabs) and subjected to 
manual PCR as follows: 15 cycles of 95°C for 1 min.; 
,5 .freeze on dry ice and ethanol,; warm to 25°C and add 2 pi 
of Klenow (1 U/aiI) in Klenow buffer; incubate for 2 min 
at 25°C. 

A 5 ill aliquot of the. manual PGR reaction was 
' then diluted 6-fold into a standard Taq reaction mix 
10 (without, oligonucleotide primers) and assembled using a 

■ standard PCR program consisting of 3 0 cycles of 94°C for 
30 sec, .40°C for 30 sec, and 72°C for 30 sec' 

A4, 8 or 16 aliquot of this second PCR 
reaction was then diluted into a standard Taq reaction 
15 nix containing oligonucleotide primers that prime on 
sequences contained in all four /3-lactamase genes 
5 ' - AGGGCCTCGTGATACGCC7A77 - 3 ' ar.d 

■ 5' - ACGAAAACTCACGTTAAGGGATT- 3 ' . Full-length product was 
amplified. using a standard ?CR program consisting of 25 

20 cycles of 94°C for 30 sec, 4S°C for 30 sec, 72'C for 45 
sec . 

This procedure produced hybrid j3- lactamase . 
genes whose activities can be tested against antibiotics 
including- but not limited -to ampicillinV carberiiciilin, 
25 cefotaxime, cefoxitine. cloxaciilin, ceftazidime, 
'cephalcricine and -oxaiactam, to determine the_ 
SDecif icities of the hybrid enzymes so created. 
. Kdxalactam was chosen as the test antibiotic for hybrid, 
genes.. .The best of the original /3-lactamase genes used 
30 . ' in this' study conferred resistance ' to- 0 .125 pg/ml of . • 
' .moxalactam.. After "the first'- round of recursive sequence 
recombination hybrid genes were isolated that -conferred 
resistance to 0 . 5/ig/ml "oxaiactan, yielding a 4-fold 
increase . 
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IV - Shuffling to Generate Improved Arsenate 

Detoxification Bacteria 

Arsenic detoxification is important for 
goldmining of arsenopyrite containing gold ores and other 
uses, such as environmental remediation. Plasmid pGJ103 , 
containing an .operon encoding arsenate detoxification 
operon (Wang et al . (1989) 3acteriol . 171: 83, 
incorporated herein by reference) , was obtained from • 
Prof. Simon Silver (U. of Illinois , Chicago, IL) . 
E. coli TGI containing pJG103, containing the pl258 ars 
operon cloned into pUC19, had a MIC (minimum inhibitory 
concentration) of 4 /ig/ml on LB amp plates. - The whole 
5.5 kb plasmid was fragmented with DNAse I into fragments 
of 100-1000 bp, and reassembled by PCR using the Perkin 
Elmer XL- PCR reagents. After assembling, the plasmid was 
digested with the unique restriction enzyme 3amHI . The 
full length monomer was purified from the agarose gel, 
iigated and electroporated into E. coli TGI cells. ' The 
tranf ormed cells were plates on. a range of sodium V 
arsenate concentrations (2, 4, e, 16 mM in round 1), -and 
approx. 1000 colonies fro™ the plates with the highest 
arsenate levels were pooled by scraping the plates. The 
ceils were grown in liquid in the presence of the same V: 
concentration of arsenate, arc plasmid was prepared from v - 
this culture. Round 2 and 3 were identical to round 1, 
except chat the cells were plated at higher arsenate 
levels. 5, 16, 32, 54 rrM were used for round 2; and 32, 
64, 128, 256 mM were used for selection of round 3, 

The best mutants crew overnight at up to 128 mM 
arsenate (MIC=255), a -64 - fold improvement. One of the 
improved strains showed that, the TGI (wildtype pGGJ103) 
grew in liquid at. up to : : 'my., whereas the shuffled 
TGI (mutant pGJ103) crew at 'up to 150 mM arsenate 
concentration. 

PCR program f or the assembly was 94 °C 20s r 
■50x(94°C 15s, 50°C 1 mm, 72°C 30s + 2s/cycle) / using ' a" 
circular PCR format without orimers. 
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Four cycles of the process " resulted: in a 50- 
100-fold improvement in the resistance to arsenate < 
conferred by the shuffled arsenate resistance operon; 
bacteria containing the improved, operon .'grew on medium 
containing up to 500mM arsenate. 

Although the foregoing invention has been, 
described in some detail by way of illustration and 
example for purposes of clarity of understanding, it, will 
be, obvious that certain changes and modifications may be 
practiced within the scope of the appended, claims. 

All references cited' herein are expressly 
incorporated in- their entirety for all purposes. 
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WHAT IS CLAIMED IS : 

1. A method of evolving a biocatalytic 
activity of a. cell, comprising: 

(a) recombining at least a first and second 
DMA segment from at least one gene conferring ability to 
catalyze a reaction of interest, the segments differing 
from each other in at least two nucleotides, to produce a 
library of recombinant genes; 

(b) screening at least one recombinant gene^ 
from the library that confers enhanced ability to 
catalyze, the /reaction of interest by the cell relative to 
a wi Idt ype form of the gene; 

(c) recombining at least a segment from the at 
least one recombinant gene with a further DNA segment 
from the at least one gene, the same or different from' 
the first and second segments, to produce a further 
library of recombinant cer.es ; 

(d) screening a: least one further recombinant 
gene from the further library of recombinant genes that 
confers enhanced ability to catalyze the reaction of 
interest by the cell relative to a previous recombinant 
gene ; , 

(e) repeating (c) and (d) , as necessary, until 
the further recombinant gene confers a desired level of 
ennancec ability to catalyze the reaction of interest bv - 
t he cell. 



2. The method of claim l, wherein the 
reaction of interest is the ability to utilize a 
substrate as a nutrient source. 

*3. The net hoc of claim i, wherein the 
reaction of interest is the ability to catabolize a 
comDound . 
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4. The method of claim 1; wherein Che 
reaction of interest is the ability to detoxify a . 

compound. 

5. The method of claim 1, wherein the 

' reaction of interest is the ability to synthesize a 
compound of interest. 

6> . T he method of claim 5, wherein the 
compound is an antibiotic. 

7, The method of claim 5, wherein the 
compound is an amino acid. 

8. The -method of claim 5, wherein the 
compound is a polymer. 

■ 9. ■ The method -of- claim 5, wherein the. 
compound is a carotenoi-d. 

10. The method of claim 5 wherein .the . 
compound is vitamin C. 

11. The method of. claim 5 f wherein the 
compound is indigo. • . t 

12 The method of ciairr. 3.,. wherein at lease ■ 

or.e reco~^inir.g see? is performed in vitro,, and the • 

Resulting library of recombinants is • introduced inio^the 

• cell whose biocataly tic act.ivi.ty is . to be enhanced * . 

' i rnv 2 -v n z ^»i v s containing different 
, generating a l.iDra^y o^ . - b - ■ 

\. recombinants. 



L 



13, The method of claim' 12, wherein the in 

2 vitro recombininc step comprises: 

3 ' cleaving the firs: and second segments into 

4 fragments; 
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mixing and '-denaturing ' the fragments ; and 
incubating the denatured fragments with a 
polymerase under conditions which result in annealing of 
the denatured fragments and . formation of the library of 
recombinant genes. 

14. The method' of claim 1, wherein at least 
one recombining step is performed in vivo.- 

15. The method of claim 1, wherein the 
recombining step is performed, in the cell whose 
biocatalytic activity is to be enhanced.' 

16. The method of claim 1, wherein at lease 
one DNA segment, comprises a. cluster of genes collectively 
conferring ' ability to catalyze a reaction of interest.. 

17. A method c: evolving a gene 'to confer 
ability to catalyze a reaction of interest, the method 
comprising : 

• ■ (1) recombining a: least- first and second DN T A 
segments from at. least one gene conferring ability to 
catalyze' a' reaction -c-f interest, the segments differing 
from each other in at least two nucleotides, to produce a 
library of recombinant genes; ' 

(2) screening a t leas t one re c ombi nan t gene 

- - from— the -Libra r-y-~t hat -cdnf ers~- enhanced- -ab:ii-i-tv -t-c ■ --■ : 

catalyze a reaction of interest relative to- a. wildtype 
■form of the gene; 

(3) recombining at least a segment .from the-. at 
least one recombinant gene .with a ■ further .DNA segment* 
from the at least one .gene, the s.ame or different from 
the' first' and second segments-, to. produce a further- 
library of recombinant genes; * 

(4) screening at least one further, recombinant 
gene, from the further- library of : recombinant genes that 
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20 confers enhanced, ability 'to,- catalyze a reaction of 

21 interest relative to a previous recombinant t,ene; 

22 (5) repeating ,(3) and (4) , as necessary, until 
23, the further recombinant gene confers a desired level of 

24 enhanced ability to catalyze a reaction of interest. 

1 18. A method of generating a new biocatalytic ' 

2 activity in a cell-, comprising: 

3 (1) recombining at least first and second DMA 

4 segments from at least one gene conferring ability to 

5 catalyze a first reaction related to a second reaction of 
" 6 interest, the segments differing from' each other in at 

7 least two nucleotides, to produce a. library of 

8 . recombinant gene's ; 

9 (2) screening at least one * recombinant gene 

10 from the library that confers a new ability to catalyze 

11 .the second reaction of interest ; 

12 • . (3) . re combining a: least a segment, from at / " 

13 least one' recombinant gene with a further DNA segment 
.14 'from the at least one gene , the same or different from 

15 the first and second segments, to produce a further 

16. library of ' recombinant genes; 

17 / ' (4) screening at 'least one ' further recombinant 

18 'gene from the further library o f. recombinant genes that 

19 confers enhanced ability to catalyze the second reaction 
2 0 of interest -in the ceil relative to a previous 

2.1 recombinant gene; - 

22 (5) _ repeating' (3) and (.4)., as 'necessary , until 

,23 the further - recombinant gene confers a desired level of 

24 - enhanced 'ability to catalyze the- second reaction of-' 

25 interest in the cell. 

1 19. A modifier form of a cell, wherein the 

2 modification comprises a -eta bo lie pathway evolved by 

3 recursive sequence recombination. 



SNSCCOD: <v;C 3;35?*£ai> 
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1 . 20 • A method of optimizing expression of a 

2 gene product, the method comprising: 

(1) recombining at least first and second DNA 
segments from at least one gene conferring ability to 
produce the gene product, the segments differing from 
each other in at least two nucleotides, to produce a 

7 . library of recombinant genes; 

8 {2) screening at least one recombinant gene 
from the library that confers optimized expression of the 
gene product relative - to a wildtype form of the gene; 

(3) recombining at least a segment from the at 
least one recombinant gene. with a further DNA segment 
from the at least one gene, the same or different from 
the first and second segments, to produce a further 

15 library of recombinant genes.; 

16 (4J screening at: least one further recombinant 

17 gene from the further library of recombinant genes that 
confers optimized ability :o produce the gene product 
relative to a previous recombinant gene; 

(5). repeating \2) and (4), as necessary, until 
the further recombinant cene confers a desired ' level of 
22 optimized ability to express the cene oroduct. 



2 - • ine "etnoc 1 of 'claim 20, wherein the at 
.east one cene encodes the cene oroduct. 



•- ■■ ^2-v"- -rhe- me t hod— of cialrv-2'0, wherein the" at 
least one gene is a vector comprising a gene encoding th- 
gene product. 



9 
10 
11 
12 
13 
14 



18 
19 
20 
21 



1 
2 

1 



.23. The method cf claim 20, wherein at lea; 
one recombining step is performed in vivo. 



24. The method of claim 23, wherein the 

2 recombining step is performed in a host ceil wherein th. 

3 gene product is exoressed. 
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.1- 25. The method of claim 20, wherein the at 

2 least one gene is a host cell gene and wherein the host 

3 cell gene does not encode the gene product. 

1 26. The method of claim 20, wherein 

2 optimization results in increased expression of the aene 

3 . product . ,. 

•1 27. A. method of evolving a biosensor for a 

2 compound A of interest, the method comprising: 

3 (1) recombining at least first and' second DNA 
• 4 segments from at least one gene ' conferring ability to 

5 detect, a related compound 3, the segments differing from 

6 each other' in at least two nucleotides, to produce a 
■7 library of recombinant genes; 

8 - (2) screening at least one recombinant gene 

9 from .che library that confers optimized ability to detect 

10 compound A relative to a wildtype form* of the gene; 

11 (3) recombining - at least a segment from the at 

12 least- one recombinant ger.'e with a further DNA segment 

13 from the at least one . gene , the same or different 'from 

14 the first' and second segments, -.to produce a further 

15 library of recombinant genes; 

.16 (4) 'screening at least, cne further- recombinant 

17 gene' from, the further library of recombinant genes that 

13- confers opt imi z ed ability to detect compound A relative 

19 • to a previous* recombinant' dene; 

20 - (5) repeating (3)' 'and (4), as necessary, until 

21 #>the further recombinant gene confers a desired . level of 

22 optimized ability to detect compound A. 

1 28. The method of claim 27, wherein 

2 optimization results in increased, amplitude of response 

3 by the biosensor. 

1 29. The method of claim 27, whe rein compound A 

2 and compound 3 are different. 
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1 30 • The method -of claim 27, wherein compound A 

2 and compound B are identical. 
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